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Abstract 



The maximal (or Hilbertian) correlation coefficient between two random variables X 
and Y , denoted by {X : Y}, is the supremum of the | Corr(/"(X),g'(Y))| for real mea- 
surable functions f ,g, where "Corr" denotes Pearson's correlation coefficient. It is a 
classical result that for independent pairs of variables {Xi,Yi)iei, [Xi : Y/} is the supre- 
mum of the {Xi : Yj}. The main goal of this monograph is to prove similar tensorization 
results when one only has partial independence between the (Xj,J:^j);_more generally, 
for random variables {Xi)i^i,{Yj)j^j, we will look for bounds on {Xj : Yj} from bounds 
on the {Xj : Y/}, isljsj. 

My tensorization theorems will imply new decorrelation results for models of sta- 
tistical physics exhibiting asymptotic independence, like the subcritical Ising model. I 
shall prove that for such models, two distant bunches of spins are decorrelated (in the 
Hilbertian sense) uniformly in their sizes and shapes: if / and J are two sets of spins 
such that distiij) > d for all i £ I,j £ J, then one gets a nontrivial bound for [Xj : Yj} 
only depending on d. 

Still for models like the subcritical Ising one, I shall also prove how Hilbertian 
decorrelations may be used to get the spatial CLT or the (strict) positiveness of the 
spectral gap for the Glauber dynamics, via tensorization techniques again. 

Besides all that, I shall finally prove a new criterion to bound the maximal cor- 
relation : ^} between two c-algebras ^ and ^ form a uniform bound on the 
|P[A nB]- P[A]P[B]| / v/P[A]P[B] for all A £ e ^. Such criteria were already 
known, but mine strictly improves those and can moreover be proved to be optimal. 



Introduction 



Overview of the monograph 

This monograph is devoted to the study of Hilbertian correlations (also called "maxi- 
mal correlations" or "p-mixing coefficients"), in particular to showing how this concept 
can be 'tensorized' to yield new results on systems of statistical mechanics exhibiting 
asymptotic independence. I have divided it into six chapters: 

• The first chapter, numbered 'jOj', aims at motivating the study of Hilbertian cor- 
relations and their tensorization. In this chapter, I will recall some classical 
results on the subcritical Ising model, which is a classical model showing asymp- 
totic independence between pairs of spins. When one gets interested in very large 
'bunches' of spins, it is known that asymptotic independence cannot be captured 
by ^-mixing any more, but that, in certain cases at least, it still holds in terms of 
p-mixing. The techniques used so far to establish p-mixing for bunches of spins 
are strongly limited by technical assumptions looking somehow artificial, which 
will motivate studying p-mixing 'for itself and trying to tensorize it. 

• In Chapter [l| I shall recall the definition of the Hilbertian correlation coefficient; 
I shall also recall some classical facts about this concept and give some examples. 
This chapter can be seen as a 'crash course' on p-mixing for the non-specialist 
reader: almost nothing in it is new. 

• In Chapter ^ I shall give some new criteria to bound the Hilbertian correla- 
tion between two cr-algebras, which criteria assume bounds on the (P[A nS] - 
P[A]P[5]) for events A and B belonging to these respective cr-algebras. My 
"strong event sufficient condition", which improves previous results by several 
authors, shall even be shown to be optimal. 

• Chapter |3] is the core of this work: in it I will handle tensorization of Hilbertian 
decorrelations. This chapter begins with a refined version of the concept of corre- 
lation, called "subjective correlation", which is necessary to write the subsequent 
tensorization results. Then I shall state and prove my three main tensoriza- 



tion theorems: Theorem 3.2.2 ('A'^ against 1' theorem) bounds the correlation 



between a 'simple' and a 'vector' variable; Theorem |3.3.1| ('A'^ against M' theo- 
rem) deals with correlation between two vector variables, and Theorem |3.3.10 
('Z against Z' theorem) refines the previous one in the case where certain sym- 
metries are present. Then, I will discuss some refinement and optimality state- 



ments about these theorems; in § 3.9 I will also present a geometric corollary of 
tensorization results which underlines quite well the Hilbertian aspect of maxi- 
mal correlations. 

In Chapter |4[ I will continue to use the tensorization techniques of Chapter [3| 
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but that time instead of proving tensorization results stricto sensu I will turn 
to different types of results, namely the spatial central limit theorem and the 
presence of spectral gap for the Glauber dynamics. 
• Finally, Chapter [5] will present some concrete applications of the results of this 
monograph. For instance, I shall prove new results about decorrelation between 
distant bunches of spins in Ising's model (see Theorem 5.1.1 ); I will also give re- 
sults of the same type for quite general models of statistical mechanics (see e.g. 
Theorems |5.2.10| and |5.3.7[ ), also proving spatial CLT and spectral gap for the 
Glauber dynamics for these models. I will also show how tensorization of Hilber- 
tian correlations can be used to get 'hypocoercivity' results [Theorem |5.4.6[. 



Conventions and notation 

Notation will not always be perfectly rigorous: to make reading easier, it may occur 
sometimes that formalism is slightly loose, or that some writing conventions or as- 
sumptions are implicit. However this shall only be done in situations where adding 
the missing information by the reader is (hopefully) obvious. 

Here is some notation used throughout this text: 



Miscellaneous 

• The symbol N denotes the set of nonnegative integers, including 0. The set of 
positive integers N {0} is denoted by N* . 

• For a,b real numbers, a^b denotes min{a,6}, resp. av6 denotes max{a,6}; a+ 
denotes the positive part of a, i.e. a v 0. 

• For A a set, A*^ denotes the complement set of A (the set of reference shall always 
be clear); 1^ denotes the indicator function of A, that is, the function being 1 
on A and on A'^. 

• For A,B sets, AaB denotes the symmetric difference of A and B, i.e. (A---5)i+J 
(B-^A), where "i+j" means the same as "u", but with underlining that the union 
is disjoint. 

• The identity matrix in dimension n will be denoted by I„. The transpose of a 
matrix A will be denoted by A^. 

• If is a set endowed with a metric dist, then for I, J c 0, distil, J) denotes the 
distance between / and J, that is, distil, J) ■- inf{distii,j): i g I,j e J}. 

• As is customary in physical literature, oc means "proportional to". 

• Whenever / is a set and X a symbol, Xj will be a shorthand for "iXi)iei". 

Probability 

• We will always work on an implicit probability space (H,^) equipped with a 
probability measure P. Sub-cr-algebras of will be merely called "cr-algebras"; I 
will also often write "variable" for "random variable". Unless explicitly specified, 
variables on D, can be valued in any set. 

• If /" is a real random variable, the expectation of f is denoted by E[/"]; its variance 
is denoted by Var(/"); its standard deviation is denoted by Sd(/") := ^/Varif)', ifg is 
another real variable, the covariance between f and g is denoted by Covif,g) ■■= 
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E[fg]-E[f]F,[g]. All that notation extends to the case where f and g are valued 
in some vector space IR''^, except that in that case it refers to vectors or matrices. 
If B is an event with P[fi] > 0, then P[A|fi], E[f\B], Var(/"|B), . . . stand resp. 
for the probability of A, the expectation of f, the variance of /", ... under the 
conditional law dP[-\B] ■■= I5 dP[-]^P[B]. Similarly, if ^ is a a-algebra, P[A|=^], 
E[/"|^], . . . stand for the conditional probability of A, the conditional expectation 
of f, ... w.r.t. ^. 

Concerning conditional expectations, I will actually use two different conven- 
tions: for a cr-algebra, E[/"|^f ] can al so be denoted by /"^. Both conventions can 
be used inside the same formula EE 



If X is a variable on D., the cr-algebra generated by X (that is, the smallest a- 
algebra w.r.t. which X is measurable) is denoted by aiX). If ^ and are a- 
algebras, the cr-algebra generated by ^ and (that is, the smallest cr-algebra 
containing both ^ and is denoted by ^ v "J^, and for an arbitrary number of 
cr-algebras this notation extends into the 00-ary operator V- 
An event Ae is said to have trivial probability, or to be trivial, if P[A] e {0, 1}. 
A cr-algebra is said to be trivial if all its events are trivial. The cr-algebra {0,O}, 
which is trivial under any law P, will be denoted by © and refered to as "the" 
trivial sigma-algebra. 

The Lebesgue measure on IR" will be denoted by dx, "x" being the name of the 
integration variable. For a Borel set A c U"-, jy.^j^dx will sometimes be denoted 
by |A|. 

For C a positive-semidefinite matrix (possibly of dimension 1, in which case it is 
identified with cr^ G jY{C) denotes the law of the centered Gaussian vector 
with covariance matrix C. I will write JY{C) + m to denote the non-centered 
Gaussian vector with variance C and mean m. 



Functional analysis 

• Unless otherwise specified, all the functional spaces considered in this mono- 
graph shall be real. 

• For / an open interval of R and ^ g N u {00}, '^^{1) denotes the subset of functions 
of ^^(/) with compact support. 

• If ju is a nonnegative measure on some measurable space iQ.,S§), L^ijx) denotes 
the set of measurable functions f (up to /i-a.e. equality) such that /q /"(w)^ d^icS) < 
00. If 7 is a countable set, L^(7) denotes the set of functions f :I such that 
Hieifd)^ < 00. If =^ is a cr-algebra, L^i^) denotes the space of =^-measurable 
functions (up to a.s. equality) which are square-integrable w.r.t. P. All these 
spaces are equipped with their natural Hilbertian product <•,•) and the associ- 
ated norm ||'||. 

• For jj, a finite measure, in L^ijS) the constant functions make a line which can 
be identified with K; then, L'^iji) will denote the quotient L^(/j)/[R, equipped 

f*^The use of the first or the second convention will depend on the way we prefer to see the conditional 
expectation of f w.r.t. if it is rather seen as the expectation of f knowing the information of 'S, nota- 
tion E[/'|'^] will be chosen, while if it is more seen like the '^^-measurable function best approximating 
f, we will use the notation /"^. 

'^^^One must not confuse Var(/'|'^), which is the variance of f under the law P['|'^], with Var(/''^) 
which is the (unconditioned) variance of the random variable f'^. One has the well-known identity 
Var(/) = VarC/^) + E[Var(/'|'^^)], which I shall refer to as associativity of variance. 
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with its natural Hilbert structure. In other words, if /" G L^()u) is the projec- 
tion of /"£ Il/Ilz;2 := infill/ -a||i2: a g 1} = (II/IIJ2 - (/■,l/||l|l>^2)^^^- L^l^) 
can also be seen as the subspace of centered functions of L^(/i), i.e. as {f eL^ijj): 
(/",!> = 0}; throughout the monograph we will implicitly switch between both 
interpretations. 

If L : Hi H2 is a linear operator between two Hilbert spaces, then L* : 
H2 Hi denotes the adjoint operator of L, characterized by the relationship 

{L*y,x)Hi = (y,Lx)H2- 

If L : E ^ F is a linear operator between two Banach spaces (not necessarily 
Hilbert) with respective norms \\-\\e and II-IIf, the operator norm of f, denoted 
by Ill/Ill, is defined as sup{||Lx||F : ||x||£ = 1}. 

If L : E ^ E is a bounded linear operator on a Banach space, then p(L) denotes 
the spectral radius of f, that is, p(L) := limy^^oo lll-^'^ 111''^'^* — this limit always exists. 
A column vector (ai)iei will automatically be identified with the corresponding 
element of L^(7). Likewise, a matrix A = {(aij))(ij)eixj will be identified with the 
corresponding linear operator from L^{J) to L^il). 

In our computations we will often use the Cauchy - Schwarz inequality and its 
variants ; when using such an inequality, we will indicate it by writing "CS" 



under the inequality sign concerned. Similarly, "IP" under an equality sign will 
mean that this equality follows from integrating by parts. 
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Chapter 



Motivation 



0.1 Some results on Ising's model 



In this subsection we recall the definition of Ising's model and give two classical results 
on it, namely Theorems 0.1.8 and 0.1.9 In § 0.2[ considerations on these results will 
serve as a motivation to the work of this monograph. 



0.1. a Ising's model 

Ising's celebrated model is a basic model of equilibrium thermodynamics, which repre- 
sents a ferromagnetic material: 

0.1.1 Definition. For n an integer, consider the lattice Z"^ endowed with its usual 
graph structure (each vertex has 2n neighbours), and denote by dist the graph dis- 
tance. Define D = {±1}^ , and for ojeVL, set formally: 

H((x)) = -^ ^ ojiWj. (A) 

dist(i,j)=l 

Then, for T 3^ 0, the Ising model on at temperature T is, formally, a probability 
measure P on H such that P[w] oc exp( - T~^H{(i))). In rigorous terms, saying that P is 
an equilibrium measure for Ising's model means that for all i g Z", for all a){i}c g {±1}'^' , 

P[£(Ji = Wj|w{i)c = W{j}c]oc exp(T"^ <i^j%). (B) 

dist{i,j)=l 

Ising's model and the phase transition it exhibits have been the subject of dozens 
of works; see [21] for an overview. Here we are interested in the subcritical regime: 

0.1.2 Theorem (Subcritical regime, fS^l). There is a Tc < oo (the 'Curie temperature') 
such that the solution of^ is unique for T >Tc . 

For T > Tc one says that they are in the subcritical regime. An interesting feature 
of this regime is that for distant i and j, the random variables and wj are 'almost 
independent'. That phenomenon, called exponential decay of correlations, is stated by 
the following theorem: 
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0.1.3 Theorem (Exponential decay of correlations, (U). For Ising's model on Z" in the 
subcritical regime, 

(i) For all i e Z^ P[wi = -1] = P[a)i = 1] = 1/2. 

(ii) There exists y/>0 and C <oo such that for all i,j ^ Z", 

|E[wiWj]| ^ Cexpi-iifdistiiJ)). (C) 



0.1. b Absence of /3-mixing 



Theorem |0.1.3| states that two distant spins i and j are exponentially decorrelated. 
However, it does not inform us about the dependence between 'bunches' of spins. The 
question is the following: if I and J are two disjoint, distant subsets of Z", to what 
extent are coj and coj independent? 

To answer such a question, the first thing to do is to define a way of measuring 
independence between 'complicated' variables having an arbitrarily large range like coj 
and CO J. The most common choice is the /3-mixing coefficient: 

0.1.4 Definition. 

1. Recall that for /i, v two probability measures on the same measurable space 
(D.,^), the total variation distance between and v is the total mass of both the 
positive and the negative parts of the signed measure v-jU, that is, dist^vilJ^v) - 
sup^e_^ |v(A)-)u(A)|. 

2. If X and Y are two random variables (with arbitrary ranges) defined on the same 
space, then one defines the ^-mixing coefficient between X and Y as 

P(X,Y) ■■- distTyiLawx '^Lawy ,Law(x,Y))- (D) 

Notice that (i{X,Y) actually only depends on the cr-algebras a{X) and a{Y) [71 For- 
mula (1.5)]. The following proposition is immediate: 

0.1.5 Proposition. 

(i) One has always I5(X,Y) e [0, 1], and 

(ii) p(X,Y) - if and only ifX and Y are independent ; 

(Hi) I3(X,Y) = 1 if and only ifLawx ^Lawy and Law{x,Y) are mutually disjoint, 
(iv) IfX' is X-measurable and Y' is Y -measurable, then p(X',Y') ^ I5(X,Y). 

So, (iiX,Y) is a way of measuring 'how much X and Y are correlated'. 

With that tool at hand, decorrelation between bunches of spins in statistical physics 
models has already been thoroughly studied. Concerning Ising's model, there are two 
well-known great results: 

0.1.6 Theorem (Weak mixing property, For Ising's model on Z" in the subcritical 



regime, there exists y/ > and C < oo (the same as in Theorem 0.1.3, ^ such that for all 
disjoint I,J c Z"; 

I3{a)i,ajj) ^ C exp{-i(/distii,j)). (E) 

{ij)elxj 
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0.1.7 Theorem (Complete analyticity, 11151). There exists some Tc ^ < ocp^gucA that 
for T> T'^, Ising's model is completely analj^ical, i.e. there exists y/' > and C > 
such that the following holds: for all K c Z", for all 'boundary' conditions ojk g {+1}^, 
denoting P^^ := 'P\_'\(i)k = (j^k\ Formula holds with Law replaced by Law^^ and y/,C 
replaced resp. by y/' and C. 



Thanks to Theorem 0.1.6 we get an exponential decay of correlation between two 
bunches of spins of fixed size when the distance between these bunches increases. 
However, we cannot say much about decorrelation between bunches of variable size 
which are at fixed distance from each other. For instance, for re = 2 fix x > and define 



II '■- {(0,y): \y \ ^ I}, resp. J/ == {ix,y): \y \ ^ I}. Then Theorem |0 . 1 . 6 1 gives us something 
like: 

Pi^i„wj,)< Cle-'l'\ (F) 

But recall that a /5-mixing coefficient is always bounded by 1; so, for I > e^^lC, (|f]| tell 
us absolutely nothing about the decorrelation between // and J/. 

Though the bound (|e]) is not completely optimal, the previous point is an intrinsic 
shortcoming of jS-mixing coefficients, in the sense that it can be proved that bounds 
like ([f]) must become trivial when I ^ oo: 

0.1.8 Theorem. For all T(. < T < oo, for all x>0, denoting 7 : = {0} x Z and J ■ = {x} xZ, 

one has 

p{aji,wj) = l. (G) 



Proof. Denote io ■= (0,0), resp. j'o == (x,0). As we told in Theorem |0.1.3[ -(p), 
E[wjQ],E[w^Q] - 0. Interpretation of Ising's model as a random-cluster model II21[ § 1.4] 
shows that P[W;„ = > 1/2, so we define 



j-=E[ajioa)jJ>0. 



(H) 



Now, let N be some large integer, fixed for the time being, let p be some large 
integer and define ii,...,ijv, resp. ji,...,jAr, by ik ■= {0,kp), resp. jk ■= {x,kp); denote 
by Pjvp the joint law of (w^^,.. . . By translation invariance, for each k, 

(a>i^,a>/J has the sa me law as (wj^, w^q), which law we denote by Pi. Then when p ^ oo, 



by Theorem 



0.1.6 



Pn,p tends to the law Pn,i 



m 



In other words, Piv,oo is 
the law such that all the {a)ij^,ix)jf^) are independent with PN,ooi(J^ik - V ^^^d =6] = 
(l + jT]9)/4 for all k. Therefore, the value of p{(o)ii^)k,(^jk'>k) under the law Pn,p, which 



by Proposition |0 . 1 . 5| -( |ivl > is a lower bound for p(a)i,ojj), tends to its value under Pjv,c 
when p —>■ oo. This is summed up by the following formula: 



(I) 



To end the proof, we will bound the right-hand side of ([T]l below by a quantity which 
tends to 1 when N ^ oo. Denote by Pjv.oo the product of two the marginals of Piv,oo rela- 
tive resp. to the icoi^)k and the (aiy^)^, so that PPN,co((^ik^k,iiOjkh) = distTyiPN,oo,PN,oo) 

^*ht is not known whether - Tc today, but in general situations weak mixing does not always imply 
complete analyticity. A classical counterexample is Ising's model with external field Il30l § 2]. 

^^^Note that Pn,p takes its values in a space of finite dimension, so there is no ambiguity when speak- 
ing of its convergence. 
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by the very definition of the j6-mixing coefficient. Obviously the expression of Pn,oo is 
the same as the expression of Pjv.oo, but with 7 replaced by in the definition of Pn,oo- 
Under Pn,oo, (^ik^jk'>i^ksiN is a sequence of i.i.d. random variables having a certain 
law with mean 7 > j/2, so that by the law of large numbers, 



N 



Pn,oo 

Similarly, since < j/2, 

Pn,oo 
so that 

distTyiP N, 00, Pn, 00) ^ Pn, 00 
which proves our point. 



r 



0. 



N 



r 



_N y 



N^oo 



1, 



Pn oo[the same] 



(J) 



(K) 



1, (L) 

□ 



O.l.c Presence of p-mixing 



So, Theorem 0.1.8 tells us that, for Ising's model on Z^, there is a 'full' correlation 
between wi and ojj in the sense of )6-mixing. Yet it is well known too that Theo- 
rem 0.1.6 nevertheless implies a Hilbertian form of decorrelation (called "p-mixing". 



cf Remark 1.1.21 between these variables: 



0.1.9 Theorem. For Ising's model on in the subcritical regime, defining as before 
/ = {0} X Z and J - {x} xZ for some x>0, one has for all f e L^iwj) and g g L^icoj): 



|E[/-^]|^e-^^Sd(/-)Sd(^), 



(M) 



where y/ is the same as in Theorem 0.1.6 



Proof. Define the operator 



LHm) 
f 



(N) 



(Recall that f'^^'^J^ is an alternative notation for E[/"|wj], insisting on the its being a 
o"(iOj)-measurable function). Then (|m]| is equivalent to proving that |||P||| ^ e~^^ (see 



§ l.l.cl. Now for all te{0,...,x}, denote (ij(t) ■= '^{i}xz, and for all ^ g {1,. . . ,x}. 



f 



LHoJit)) 



(0) 



Due to the fact that the interactions in Ising's model have only range 1, (W(o) 
• • • ^ a)(x) is a Markov chain, and therefore 



P^n^o- 



(P) 
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Now, by horizontal translation all the L^{a>(t)) can be identified with a common Hilbert 
space H. Then all the are identified with operators on L^(H), and by the translation 
invariance of the model all these operators are actually the same. P is also identified 
with an operator on L^{H), and (|p]) becomes: 

P = n\ (Q) 

But n is self-adjoint because, as the model is invariant by translation and by reflection, 
the Markov chain a»(o) ■■■ ^ a)(x) is stationary and reversible. In particular tt is a 
normal operator, and thus |||P||| = |||7r|||^. So, proving that |||P||| ^ e~^^ is equivalent to 
proving that |||7r||| ^ e~^, which will be our new goal. 



Take C <oo like in Theorem 0.1.6 For I an integer, denote // and Ji to be resp. {0} x 
and {x} X Let /" be a boundec function of L^Cw/^) and denote 



M ■■= ||/"||l«>. By translation, f can also be identified with a function of L^iwj^), which 
is also bounded by M. Now, since 

E[/(t07^)/"(wj,)] - Cov(/"(a)/^), f(a)ji)) = j f{(^ij)f{cjj^)d{Law((^ii^ji)-Law{a)i^)®Law 

(R) 

we can apply (|e]) to Ii and Ji to obtain: 

mnmi)n'^Ji)}\ ^ M^-2Ci2l + lfe-^\ (S) 

In terms of operators, ([S]) means that 

\{f,Pf)LHH)\ ^ 2(2Z + D^M^Ce-^^ (T) 

As the value of x played no particular role to establish (|T]), that formula can be gener- 
alized into 

\{fyf)L2(H)\ ^ mi + ifW'Ce-l" (U) 
for all ^ e N*. Letting t tend to infinity, we obtain that for all I, for all f eL^(co^)r\L°°, 

^h^iloglifynifUe-"^. (V) 



is a dense subset of L^Co*/), so by Lemma |0. 3.1 1 set in appendix. 



we conclude that |||:?r|||^2(^) ^ e ^, which is what we wanted. □ 



0.1.10 Remark. Claim 0.1.8 and Theorem 0.1. 9| adapt straighforwardly with similar 



proofs, to any n^2, replacing / by {0} x Z" ^ and J by {x} x Z" ^ 



0.2 Problematics 



Thanks to Theorem 0.1.9 we see that the Hilbertian concept of p-mixing can reveal 
some independence between infinite bunches of lowly correlated variables in situations 
where the j6-mixing coefficient does not show any independence at all. In the proof we 
gave, p-mixing appeared as a corollary of /5-mixing for finite bunches of spins. What 
additional hypotheses did we need to get our corollary? We used at least the following: 



'^*^In fact here it is superfluous to impose that f is bounded since can only take a finite number of 
values. I wrote the proof like this just to underline that the finiteness of the range of the coi does not 
play any role in the proof 
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To introduce the Markov chain a>(o) ■■■ ^ (^{x), we used that the interactions of 
our model had finite range. 

To identify all the spaces L^ico^t)), we used that / and J had the same shape and 
that one could tile up with a sequence of tiles having that shape (namely, here, 
tiles of the form {t} x Z). 

To say that all the nt were the same modulo that identification, we used the 
translation invariance of the model. 

To state that the stationary Markov chain fc»(o) ^ • • • — ► was reversible, we 
used the reflection invariance of the model. 

To use Lemma 0.3. 1[ we used the exponential decay of correlations. 



All these points make the proof of Theorem 0.1.9 we gave in § |0.1 quite difficult to 



generalize. What, for instance, if we take I and J with arbitrary shapes, just requiring 
that distil, J) ^ X? What if we consider statistical physics models with infinite-range 
interactions? Etc.. The above arguments would not work any more! Yet, we do not 
have the impression that the presence of p-mixing fundamentally relies on the peculiar 
symmetries of the case we treated. . . 

So, here will be the goal of this work: establishing p-mixing estimates by general 
methods. To achieve this goal, I shall try to concentrate on the properties of p-mixing 
'for itself, rather than on its links with other forms of decorrelation. I will carry out a 
thorough study of the p-mixing coefficient, in order to get p-mixing results for 'compli- 
cated' variables from decorrelation results of the same type for more 'basic' variables; 
in other words, I will tensorize Hilbertian decorrelations. It turns out that tensoriza- 
tion for such correlation coefficients gives results which are quite robust as the size of 
the bunches of variables increases. Thanks to this method, I shall obtain fairly new 
decorrelation theorems for various models of statistical physics. 

This monograph is intended to be complete in some sense. I mean, besides the core 
of this work — namely, tensorization results — , I have tried to answer several other 
questions which appeared natural to me concerning Hilbertian decorrelation. This 
includes studying many examples, finding sharp criteria for maximal decorrelation, 
looking at the optimality issues in the tensorization results or showing other applica- 
tions of the tensorization techniques. Though these topics were initially thought as 
'sidework', some of them may be quite interesting for themselves. 



0.3 Appendix: On the norm of self-adjoint operators 

In this appendix we prove the following 

0.3.1 Lemma. Let Lbea self-adjoint operator on a real Hilbert space H, and let C <oo. 
Then, to prove that \\\L\\\ ^ C, it suffices to ensure that 

{xeH: Ihi^ KL*x,x>|^^* ^C} (W) 

k^oo 

is a dense subset ofH. 

Proof. Reasoning by contraposition, we have to show that, for L a self-adjoint operator 
on H, for all C < \\\L \\\ , the set of the xeH such that 

]h^\{L''x,x}\^"'>C (X) 
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contains a non-empty open subset of H. 

Since L is self-adjoint, by the spectral theorem II43[ Theorem 7.18], it is unitarily 
equivalent to the "multiplication by identity" operator M on a space ®aeA^^iPa), for 
some set and pa some Radon measures on IR, that is [in the following equation, the 
variable A is free, so that /"(A) is synonymous with /"]: 



M{Y^fa{X)) = Y.^faa). 



(Y) 



a a 



So we will assume L is of that form. 



One has obviously: 



IIILIII = sup{A ^ 0: (3a G A) (pa([-A, A]^) > 0)}; 



(Z) 



moreover, for all f eH, f 



fa with fa £ 



(pa), 




(AA) 



SO that (observing that, for k even, A^ 3^ VA) 



lim \{L^f,f) 



{A^O: (3aeA)( [ |/-„(A')|2dp„(A') > O)}. (AB) 



J[-X,XY 



Now, for C< IIILIII, the set 




(AC) 



is open because f[_(j dpaiA.) is a continuous function of/", and it is non-empty 

by (|Z]). But @ is satisfied for aWxeU by (|AB]), so U fulfills our quest. □ 
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Chapter 1 

A first approach to Hilbertian 
correlations 



1.1 Definition and first properties 



1.1. a Equivalent definitions 



1.1.1 Definition. Let iD.,SS,P) be a probability space. For two sub-cr-algebras 
of the Hilbertian correlation coefficient (or merely "correlation") between ^ and ^ 
is defined as 

{^:^}:^ sup (AD) 
/eP(^)^{0}Sd(/)Sd(^) 



If the supremum in (AD I is taken over an empty set, that is, if ^ or "^^ is trivial, we 



define this supremum to be 0. 

1.1.2 Remark. : ^} is often called the "maximal correlation coefficient" or "p-mixing 
coefficient" between ^ and and denoted by pi^,^) (see 0). 

1.1.3 Remark. In other words, : is the best ^ g IR+ such that the following refined 
Cauchy-Schwarz inequality holds in the Hilbert space L^iSS): 

yfelH^) K/",^>l^^ll/"llll^l|. (AE) 



Yet another formulation is that : is the cosine of the angle between L^(^) 
andL2(^), seen as subspaces of L^(SS) — this angle being defined as the infimum angle 
between any two non-zero vectors of these respective subspaces. 

If we speak in terms of spaces rather than spaces, : ^} is the best ^ £ IR+ 
such that for all non-constant square-integrable f,g resp. ^ and "S^ -measurable, 

\Corr{f,g)\^k, (AF) 

where Corr(/",^) := Cov(/",^) Sd(/')Sd(^) is the Pearson correlation coefficient be- 
tween f and g. 
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1.1.4 Definition. We say that 3^ and are e-decorrelated, resp. e-correlated, if {3 : 
^ e, resp. : 3^ e. 

1.1.5 Definition. For X and Y random variables (with arbitrary ranges), we will de- 
note {X : Y] for {cr(X) : a{Y)]. 

1.1.6 Remark. One can rewrite Definition II . 1 . 5 1 as 

VI Cov(/(X),g(r)) 

{a : y } = sup — ; r, (AG) 

^Jsd(/(X))sd(^(r))' 

where it is implied that f and g have to be measurable, real, and such that < 
Sd(/(X)),Sd(^(r))<oo. 

More generally, all the questions relative to Hilbertian correlations may be handled 
either in terms of a-algebras or in terms of random variables. In the sequel, we will 
frequently switch implicitly between these two paradigms. 

It is natural to enquire what happens if one deals with complex spaces. In fact 
it does not change anything: 

1.1.7 Proposition ( II44|, Theorem 1.1]). Let 3 and ^ be two a-algebras and let f,g be 

two complex centered variables, measurable w.r.t. resp. 3 and Then, with Sd(/") 
meaning v/E[|/--E[/-]|2], one has: 

I E[/-^] Sd(/-) Sd(g). (AH) 

Proof. I recall the proof for the sake of completeness. Up to multiplying ^ by a well- 
chose n unit complex number, we can assume that E[/"^]g [R+. Then we can apply Defi- 
nition 1.1.1 to the real L? variables £Hc/" and 9^e^, resp. Um/" and 3m^, getting: 



I E[/-^] I = IHc E[/-^] = Erne /■ ^Hc g\ - E[am / Jm^] 

^ : m{^mKzf)^mKtg) + Sd(am/")Sd(am^)) 



^ W : "S} VVar(9ie f) + Var(am/") v/Var($He^) + Var(am^) 

cs 

= {^:<;^}Sd(/-)Sd(^). (AI) 

□ 

Now we turn to a different way of seeing correlation levels. 

1.1.8 Definition. For two cr-algebras, we denote by ;r^^ the 'projection' operator 

yrc^j.: L\9^) - LHm ,,,, 

For (T-algebras, we denote n^qi/gc ...■^3= '■- t^29 ° '^'wac o • • • o ncgg:. 

With this vocabulary at hand, 

1.1.9 Proposition. For two a-algebras, {3 - \\\n<gg^\\\. 
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Proof. Ticgg; is the orthogonal projection from L {^) to L {'^) in the Hilbert space 
L^iSS), so its norm is the cosine of the angle between L^i^) and L^C^S), i.e. : 'S}. □ 

1.1.10 Remark. One has Tig;cs = n%^, since {ncss^f,g) = ^ifg] = {f,n^<sg). Therefore 



the expression \\\n<s^\\\ in Proposition |1. 1.9| can be rewritten into VIW^^'S^ \\\, which is 
also pin^cg^) since n^cg^ is self-adjoint. 



1.1. b Immediate properties 

Having defined Hilbertian correlations, it is now time to study their behaviour. 
The following properties are immediate from Definition 1.1. 1| 

1.1.11 Proposition. For all a-algebras ^, and 

(i) {^■.^} = {^:^}; 

(ii) ^c'^' => {^-.m^l^: ^'}; 
(Hi) {^:^}£[0,1]; 

(iv) :'^} -0 if and only if ^ and are independent; 
(v) If^ is not trivial, then :^} = 1. 

When one is concerned by correlation between variables, it often occurs that some 
of these variables are vector-valued. The following proposition means that it suffices to 
know the behaviour of finite-length vectors to understand the behaviour of all vectors: 

1.1.12 Proposition. Let I, J he possibly infinite sets and let Xi,Yj be vector-valued 
variables. Then, denoting "I' d /" to mean that I' is a finite subset of I, 

{Xj:Yj}= sup {Xp-.Yj,}. (AK) 

Proof This is because \Ji'(^iL^(Xi'), resp. \Jj'(^jL^(Xji), is a dense subset of L^CX/), 
resp. L^(Yj). That property follows by classical approximation arguments like in the 
proof of 1140 [ Theorem 3.14]. See [8, Theorem 3.16(11-3)] for a more detailed proof □ 



1.1. c Operator interpretation 

1.1.13 Proposition. IfX -^Y^Zisa Markov chain, then {X:Z}^{X: Y}{Y : Z}. 

Proof The Markov chain property is equivalent to meaning that nzx - t^zyx, so the 
result is a consequence of the submultiplicativity of operator norms. See also [1391 
§VII-4]. □ 



There is a refined version of Proposition 1.1.13 which is particularly interesting for 
reversible chains: 

1.1.14 Proposition. IfX -^Y ^ Z isa Markov chain, then {X : Z} = \/ pinyzY ° ^yxy)- 
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Proof. Because of the Markov chain property, nxz = ^XY ° ^YZ and nzx = ^ZY ° ^YX- 
Using that for any pair of operators n : Hi H2 and t : H2 Hi one has p{n o j) = 
piron), we get that {X : Z}^ = pinxY) = P^t^xyzyx) = p(^XY°TtYZYx) = P^t^yzyx^t^xy) = 
P^t^yzy^t^yxy)- □ 
1.1.15 Corollary. If ■■■ ^ X-i Xq ^ Xi^ ■■ ■ is a stationary Markov chain so that 



^XiXq and nxoXi commute then for all k £Z-~~ {0}, {Xq : Xk) = {Xq : Xi) 



\k\ 



Proof. Since the chain is stationary, all the X„ have the same law and thus all 
the L^(Xn) can be identified; then the stationarity property is equivalent to saying 
that nXn+iXa = ^XiXo for all n e Z. Thanks to the commutation hypothesis, one can 
write for ^ > 0: 



{Xo :Xk} = ^J p(nxoXkXo) = \J P(^XoX^ ° ^XiXo^ = V ^^^o^i^o^ = V P^^^qXiZq)* = {^0 : ^il^ 

(AL) 

For the case ^ < 0, we use that {Xq : X^} - {X-^ : Xq}. □ 



1.1. d First criteria for decorrelation 

Density sufficient condition 

1.1.16 Proposition. Let X and Y be two random variables valued resp. in E and F. 
Suppose that Law{X, Y) has a density h w.r.t. the product probability Law{X)®Law{Y). 
Then: 

{X:Y}^([ ih-lfdLawxdLawrf^. (AM) 

J ExF 

1.1.17 Remark. The integral expression in ( |AM[ ) is nothing but 2 times the bilinearized 
version of the mutual information 

I(X;Y):= I hloghdLawxdLawy. (AN) 



Yet Example 1.3.3 will show that one does not have {X : F} ^ \/2iX;Y)-^^^ in general. 



Proof. To alleviate notation, denote resp. Pz,Py,P(x,y) for Law{X),Law{Y), 
Law{X ,Y). Let f and g be centered functions being resp. X- and F-measurable. 
Observe first that 



so that 
and thus 



f fix)giy)dPx[x]dPY[y] = {[ fdPx){[ ^dPy ) = x = 0, (AO) 

J ExF JE JF 

nfg] = j fgdP(x,Y) = I hfgdPxdPr = f(h- DfgdPxdPy (AP) 

mfg]\ ^if(h- lfdPxdPYf\ J f^g^dPxdPYf^ (AQ) 



by the Cauchy-Schwarz inequality. But the last factor in the right-hand side of ( AQ I is 



( f f\x)gHy)dPx[x]dPY[y]f^ = ( j f^dPxf\ f g^dPyf^ = Sdif)Sdig), (AR) 



so that (AMI is proved. 



You may also see (HI Theorem 2.5] for an analogous result. □ 



^*^Reversible chains always satisfy this condition since then nx^Xo - ^XoXi- 
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Event necessary condition 



1.1.18 Proposition (event necessary condition). Let ^ and he two a-algebras. If 
:'^}^ e, then for all events Ae^ and B e'^ with respective probabilities p and q. 



\P[AnB]-pq\^E^/pil-p)qil-q). 



(AS) 



In particular, if there exists two non-trivial events A e e'^ which are equivalent 
(in the sense that P[A a5] = 0), then {^-.^S}^ 1. 



[f] 



Proof It follows from (AFI applied to 1^ and I5. 



□ 



l.l.e Independent tensorization 

Now we are turning to the basic tensorization theorem, which will motivate §[3} 

1.1.19 Theorem ( [|T51 Theorem 6.2]). Let I be a set and let Xi,Yj be vectorial variables. 
Suppose all the pairs (Xj, Yj), iel, are independent, then 

{X:Y} = suv{Xi:Yi}. (AT) 

iel 



Proof. The simplest proof of Theorem 1.1.19 relies on the operator interpretation of 



correlations, see e.g. the proof of [45, Theorem 1]. Here however I shall give a proof 
based on decomposing functions of several variables into telescopic sums, for this kind 
of arguments will be used again in the proofs of the more general tensorization theo- 
rems of § [3} 



First, observe that the inequality of ( [AT| ) is trivial, so we only have to prove 
the inequality. We denote Ei ■■= {Xi : Fj}, and to alleviate notation, Xi will implicitly 
stand for an element in the range of Xj, resp. yi for an element in the range of Fj. 



By Proposition [LLT2[ we may assume that / is finite, say / = {1, . . . , A^} for some N £ 
N. Let f and g be resp. Xj-measurable and F/ -measurable centered L^ real functions; 
our goal is to bound above |E[/"^]|. 

For i G {0,...,N}, define ■■= \/ jf:ia{Xj,Yj). I claim that, because of the indepen- 
dence hypothesis, f^' only depends on the values of Xi,...,Xj and not on Yi,...,Yi, 
and similarly that g^'^ only depends on the values oi Yi, . . . ,Yi: one can write indeed 
(in the case of f) 



'^^^The converse is not true: it can occur that but that no non-trivial events of ^ and are 

equivalent. A counterexample is the following: let {Xn)nm be independent Bernoulli{ll2) variables, and 
define independently Y„ - 1-Xn with probability £„ and Y„ - X„ otherwise, where {en)nm is & sequence 
of numbers such that < £„ ^ 1/2 for all n and £„ ?° 0. Then the vectorial variables X and Y obviously 
satisfy {X -.Y}-!, yet it is not hard to prove that no X-measurable non-trivial event is equivalent to a 
Y-measurable one. 
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Now, for i G {1, . . .,N}, define 



fiixi,...,Xi)-= f^'ixi,...,Xi)-E[f\xi,...,Xi-i], 



(AV) 



with a similar definition for g. One has f = T.ifi, resp. g - T.igi, and fi and gi are 
-measurable and centered wr.t. ^i-i (that is, E[/"i|^j_i],E[^j|^j_i] = 0), so 



Var/- = ^Var/-, 



(AW) 



resp. Var^ = Zj Var^j 
We expand: 



E[fg]= ^ E[/-,^,]. 

{i,j)elxl 



(AX) 



In the right-hand side of (AX I, if i ^ j then E[/"i^y] = since if, say, i < j, fi is ^i- 

(AY) 



measurable while gj is centered wr.t. ^j-i ^ ^i- So (AX I turns into: 



EVfg^ = Y.^vfigiy 

iel 



Writing the law of total expectation, 

Wfigi] = j E[figi\xi,yi,...,Xi-i,yi-i]dP[xi,yi,...,Xi-i,yi-i]. (AZ) 

But, as we noticed before, under P[-\xi,yi,. . . ,Xi-i,yi-i], ft only depends on Xj and gi 
only depends on Yi. Moreover, because of the independence property, the law of (Xj,F;) 
is the same under V\_'\x\,yi, . . . ,Xi-i,yi-i\ as under P, so under ^\_'\x\,yi, . . . ,Xi-\,yi-i\, 
fi and gi are centered and -independent. Thus 

\Wfigi'\\^Ei j Sdifi\xi,yi,...,Xi-i,yi-i)Sdigi\xi,yi,...,Xi-i,yi-i)dP[xi,yi,...,Xi-i,yi-i] 



cs 



£iJ j yar{fi\xi,yi,...,Xi-i,yi-i)dP[xi,yi,...,Xi-i,yi-i]\/the same for g 



^eiSdifi)Sdigi^ (BA) 

Summing over i, 
mfg]\ ^ E£iSd(/-OSd(^0 ^ supei-Y.Sd{fi)Sdigi) 

iel i^I iel 



^ supsi ■ J^Yarifi) Y^^arigi) = super Sd(/-)Sd(^), (BB) 

CS iel y iel V iel 

which is the desired bound. □ 



f*^The last inequality is actually an equality, because fi and gi are centered w.rt. ^t-i. 
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1.2 Examples 



1.2.a Finite-ranged variables 

1.2.1 Proposition. Let X and Y be random variables with finite ranges resp. {1,...,N} 
and {1,...,M}, and denote Pa -PIX = a], p^:=P[Y = bl Pa -PIX = a and Y = bl 
Then {X :Y} - \\\Il\\\, where H is the N x M matrix with general entry 



a 



nab = ■ (BC) 



1.2.2 Remark. In particular, if both X and Y have range {1, 2}, using the same notation 
as before, one has 

{X:y} = ^k^, (BD) 



where the right-hand side of (BD I does not depend on the choice of a, 6 £ {1,2}. 



Proof of Proposition 1.2.1 By Proposition 1.1.9, {X -.Y} is the norm of the operator 



TiXY '• L'^iY) L^(X). Here it will be more convenient to work in spaces than in 
spaces, so we rather compute the norm of 



n: LHY) - LHX) 

ng], 



X T.r_n (BE) 



which is obviously the same as IIIttxy |||. 

A function g e L'^iY) can be identified with a M-dimensional vector also denoted 
by g, and similarly ng e L^{X) can be identified with a AT-dimensional vector. De- 

note P := iip%,b e Ix == US aa' Pa)) a, a' G ^^"""^ , Iy iiSwp''))b,b' ^ K'^^'^, 

Ijv == l'-*^' - '^' £ IR-'^. Applying Bayes' formula 5delds that 

ng = Ix^Pg-lNaN)'^Pg. (BF) 
Now, ||^|li2(y) = WifgW, resp. \\ng\\L2^x) = \\lfi^g)\\, so: 

Ul-/^P-lflNilN)''P)g\\ 

{X:Y}^ sup ^ ^- . (BG) 

rl/2 



Performing the change of variables h = Iy g, (BGI becomes {X -.Y} - sup/j^^g Hn^H 

IIAII = lliniii,with 

n = r^^'^pi-^''' - If i^{i^fpi-^'\ (BH) 



which is Equation (BCI indeed. □ 



1.2.3 Remark. With the same kind of proof, there is even a similar proposition to cal- 
culate {X,Y} if either XorY has finite range, provided you know (in the case it is X 
which has finite range) all the P[X = x] and all the 



dPV Y =y\X = x\ dPVY = y\X = x'\ 
d^\Y^y\ 
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1.2.4 Remark. In the case X or Y has range of cardinality 2, applying Proposi- 
tion (1.2.1) yields that {X : Y}^ depends smoothly on Law{X,Y). Yet this is not the 



case in general: in fact, maximal correlations are nothing more than a particular case 
of operator norms (cf § ll.l.cll, and thus they have the same behaviour — they are a 



continuous function of the parameters, but they can have some S^'^ singularity. The 
following example exhibits such a singularity. 

1.2.5 Example. Suppose both X and Y have range {1,2,3} and 



/ 2/9 1/18 



((P[X = a and F = 



1/18 \ 



1/18 2/9 + a 1/18 - a 
1/18 1/18 -a 2/9 + a j 



for a parameter a g [-2/9, 1/18]. Then the matrix n defined by (BCl is 



n 



f 1/3 -1/6 -1/6 ^ 

-1/6 1/3 + 3a -1/6 -3a 
-1/6 -1/6 -3a 1/3 + 3a; 



U 



(1/2 + 6a 


V 



0^ 
1/2 

0; 



U~ 



with 



U 



-2/^/6 1/^' 
1/^ 1/^ 
1/^ 1/^^ 

being orthogonal. So by Proposition |1. 2. 1[ {X:Y]- 1/2 + 6a+. 





1/^ 
-V\f2 



(BJ) 



(BK) 



(BL) 



1.2.b Gaussian variables 



The following theorem, which I will frequently use in the sequel, computes exactly the 
Hilbertian correlation between two jointly Gaussian variables: 

1.2.6 Theorem ( 112711^ ). Let {X,Y) he an (N + M)-dimensional Gaussian vector whose 
covariance matrix writes blockwise 



Var(X,Y) 



(In C 



(BM) 



then {X : Y} 



1.2.7 Remark. In other words, Theorem |1.2.6| tells that in the Gaussian case, the supre- 
mum in (AG I defining {X :Y} can be restricted to linear functions f and g. 



1.2.8 Remark. By a linear change of variables. Theorem 1.2.6 actually allows us to 
compute {X : Y} for any Gaussian vector iX,Y). 



Proof of Theorem 1.2.6 I recall a (sketch of) proof for the sake of completeness. By the 
properties of Gaussian vectors, the law of Y knowing that X - x [I dropped the vector 
arrows] is the normal law jV(\m - C^C) + C^x, and similarly the law of X knowing 
that y = 3/ is the normal law jYiljs! - CC^) + Cy. Consequently, the operator nxYX 
is the generator of the following random walk on IR-'^ (whose equilibrium measure is 
the standard Gaussian law): when one is at x, they jump to a point distributed ac- 
cording to the normal law J^Hn - CC^CC^) + CC^ x. This walk is a multidimensional 
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AR(l)-process (see. II34[ § 2.6]), whose properties are perfectly known; in particular, 
the eigenvalue f of nxYX responsible for its spectral radius will be a linear function, 
so we only have to consider linear f in the supremum (AG). For such f, the optimal g 
will also be linear by the Gaussian nature of the system, so in the end {X : Y} is equal 

to III cm. □ 



1.2.C Miscellaneous examples 
Random conditional laws 

1.2.9 Example. Let < p < n he integers. We consider a random variable {X,Y) for 
which Y has range ^ ■= {!,..., n} and X has range 5^ ■= ^pi^), ^pi^) denoting the 
set of subsets y with cardinality p — so, #3^ - and #^3^ = n — , and we take the 
law of iX,Y) uniform on the pairs (x,y) such that y ex: see Figure [T] 

When considered as operators on spaces, it is obvious that ttxy and nyx are 
characterized by (nxYf)ix)^p~^Lyexf(yl resp. {nYxg)iy) = Qli) Lyexgix), so that 

(nYXYf)iy) = -fiy) + E ^^/"(y')- (BN) 

P y^yPin-1) 

Thus, on L^iY), nyxY is nothing but the scalar operator and therefore 



{X:Y} = ,/^— ^ (BO) 
pin-1) 



by Proposition 1.1.9 and Remark 1.1.10 



Weakly coupled particles 

1.2.10 Proposition. Let Vi and V2 be potentials on U"-, n ^ 1, i.e. the Vi are real- 
valued measurable functions on R"" with f^n e~^'^^^cfo; < 00. For i e {1,2}, denote by Pi 
the probability measure on IR" proportional to e~^'^^^cZx, which is to be thought as the 
law of the position Xj of a particle i subjected to the potential Vi. Denote := Pi i8>P2, 
which is the joint law of(Xi,X2) in absence of interaction. 
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Now let W he an interaction potential on {Wf such that e-[^i(^i) + '^2(x2) + W(xi,X2)] 
is integrahle; denote by P the probability measure on {U"-)^ proportional to 
g-[Vi+V2+W]^^^^^^^ h^IiIqIi ig ffig joint law of {Xi,X2) in presence of interaction po- 
tential W. 

Then, under the law P, 

<™^- 

Proof. The la w P has density h = e~^-^E,»[e~^] w.r.t. P®, whence the result by Propo- 
sition 

□ 



1.1.16 



1.2.11 Remark. Proposition 1.2. 10| gives a rigorous sense to the intuition that two 



weakly coupled particles must have nearly independent positions. This is valid in 
a quite general setting, in particular, W does not have to be bounded. 



Non-reversible Markov chain 



1.2.12 Example. Here is a example showing that the inequality in Proposition 1.1.13 
is strict in general. Consider the stationary Markov chain on {1,2,3} defined by 



P = {{P[Xk+i=a\Xk = b])U 



fO 1/2 1\ 
10 

,0 1/2 0; 



(BQ) 



which has equilibrium measure (2/5,2/5, 1/5). Diagonalizing P shows that 



f2/5 2/5 2/5\ 
2/5 2/5 2/5 

,1/5 1/5 1/5; 



+ 0(2-^^2)^ 



(BR) 



whence {Xk : Xk+t) = 0(2 ^'^) when t +oo by Proposition 1.2.1 Yet {Xk :Xk+i 



1, since the non-trivial events {X}^ - 1} and {Xk+\ - 2} are equivalent (cf. Proposi- 



tion 1.1.181. 



Hyperplanes in Ising's model As I told in Chapter |0| the initial motivation of 
this work was to understand the presence of p-mixing in Ising's model (cf § 0.1. a I; in 
particular, I intended to re-get a result similar to Theorem |0.1.9| by a more 'natural' 



method. That shall be achieved indeed in § 5.1 



1.2.13 Theorem (Theorem |5. 1 . l| -(pjl). For Ising's model on Z" in the completely analyt- 
ical regime, for all disjoint I,J Z", 



{w/ :ojj}^ exp [ - ( v' + o( 1)) distil, J)] , 



(BS) 



where y/' is the same as in Theorem 0.1.7 and where the "oil)" (to be understood "as 
distil, J)^ oo") is uniform in I, J. 
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If we apply that result to the case of parallel 'hyperplanes' of Z" (I mean, sets of the 



form {t} X Z" Formula (BS I looks far less neat than Formula B in Theorem 0.1.9 



That bound can however be improved by using Proposition |1.1.14[ Indeed, as we 



noticed in § |0.1.c[ the states of two parallel hyperplanes are elements of some reversible 
stationary Markov chain. Therefore, applying Corollary 1.1.15| (in which we let k oo), 
we get a result exactly similar to (|m]), except that we have to replace f by t^f' — recall 
that it is not known whether y/' -f. 



1.3 Comparing p-mixing to other measures of depen- 
dence 

The material of this section is classical; most of it can be found for instance in (HI 
§§ 3 & 5]. Here we will say that a sequence of pairs of a-algebras i^"',^"') is p-mixing 
to mean that : ^S^f^ 0. 

1.3.a a-mixing 

1.3.1 Definition. The a-mixing coefficient of two cr-algebras ^ and is 

a(^,'^):= sup|P[AnB]-P[A]P[5]|. (BT) 

Aes^ 
Bern 



Proposition 1.1.18 shows that 'p-mixing implies a-mixing', in the sense that one 



has a{3P, "S) ^ A({^ : ^^}) for some universal function A : [0, 1] ^ [0, 1] with A(p) 0. 

1.3.2 Remark. Saying that the correlation of two variables tends to means that their 
joint law tends in some sense to the product law. When the variables are ranged in 
Polish spaces, a common notion of convergence is weak convergence, that is, conver- 
gence against all bounded continuous function. (31 Theorem 2.2] states that weak 
convergence is implied by a-mixing, hence by p-mixing. The precise statement is the 
following: if (X",y")„ei\i is a sequence of pairs of random variables such that all the X„ 
(resp. Yn) have the same law Law(X) (resp. Law(Y)) in some Polish space E (resp. F), 
then(a(Z",y")"=>°°0) => {Law(X^ ,Y'')''^ Law(X)^Law(Y)). 

On the other hand, the following example shows that a-mixing does not imply p- 
mixing: 

1.3.3 Example. For e e (0, 1/2], define (X^, Y^) in the following way: 

• With probability e, one samples X^ and Y^ independently with common law uni- 
form on [0,£:]; 

• With probability (l-e), one samples X^ and Y^ independently with common law 
uniform on [e, 1]. 

Then for all c > one has {X^ : y^} = 1, since the non-trivial events {X^ ^ e} and {Y^ ^ e} 



are equivalent (cf Proposition |1.1.18| ). However it is easy to show that aiX'^,Y'^) 
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1.3.b j6-mixing 



Recall the definition of the j0-mixing coefficient from the previous chapter [Defini- 
tion |0X1. 

1.3.4 Example. For e e (0,1), consider two random sequences (Xi)ieN and (Yi)ieN de- 
fined in the following way: (Xj)ieM is a sequence of i.i.d. variables with uniform law 
on {±1}, and for each i g N, independently, one sets Yi = Xt with probability e, and with 
probability (1 - e) one chooses Yi uniformly on {±1}. Then all the (Xj, Y^) are i.i.d. with 



P[Xi = 7] and Yi = e^ = {l + yd eVA for all T],e e {±1}, thus {Xi :Yi}^e by Remark 1.2.2 



whence {X -.Y} = e by Theorem |1.1.19[ Yet LawiX, Y) and LawiX) (8> LawiY) are mutu- 
ally singular for all E>0. 

This shows that p-mixing does not imply j6-mixing, and a fortiori that there can be 



no kind of converse to Proposition 1.1.16 



1.3.C Mutual information 



Recall the definition (AN I of mutual information. [8, Theorem 5.3(111)] states that mu- 



tual information controls the ^-mixing coefficient, so Example] 1.3. 4 which shows that 



p-mixing does not imply /5-mixing in general, shows that it does not imply mutual 
information to tend to either. 



Proposition 1.1.16 suggests that, on the other hand, maximal correlation could be 



controlled by mutual information, but that is not true either: in Example 1.3.3 indeed, 
{X' : Y'} 



1 for all e>0, but 



IiX';Y') = elog(e-i) + (1 - e)log((l - e)-^) ' 



0. 



(BU) 



Mutual information measures the quantity of information shared by two random 
variables, which explains intuitively the following property ([12, Theorem 2.5.2]): if 
X ^ F ^ Z is a Markov chain, then I{Y;X,Z) ^ IiX;Y) + IiY;Z). Does a similar 
inequality hold for Hilbertian correlation? In the Gaussian case, the answer is "yes" 



^ {X : Yr + {Y : ZV 



(BV) 



thanks to Theorem |1.2.6[ one gets that 

^ _2 a-{Y :Zf){X:Y}^ + a-{X:Y}^){Y -.ZY^ 
l-{X:y}2{y :Z}2 

But that property does not hold in general, as the following example shows: 

1.3.5 Example. Consider a Markov chain X ^Y ^ Z, where iY,X) and {Y ,Z) have the 



same law, which is the joint law described in Example |1.2.9] — the role of "F" in that 
example being played here by Y in both cases. Fix y define event A as = y" 
and event S as "y g X n Z". Then one computes that P[A] = n~^, while 

1)^ 



P[fi] 



n n(n 



(BW) 



Since A c S, Proposition 1.1.18 then yields that 



{Y :X,Z} ^ V'P[5c]P[A]/P[Ac]P[5] 



in-lf-ip-lf 
[(n-l)^ + in-l)(p-l)^ 



1/2 



(BX) 
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Comparing (BO I and (BXl, one sees that taking re » 1 and 1 « p « n^^'^ makes {X : Y) 
and {Y : Z) arbitrarily close to while \Y : X,Z} gets arbitrarily close to 1. 

1.3.6 Remark. There are similar examples with iX,Y,Z) = iY-i,Yo,Yi) for a reversible 
Markov process iYt)teu 1135 II . 
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Chapter 2 

Event sufficient conditions 



In § |l.l.d| we saw that the maximal correlation coefficient : 'S} controls the dif- 
ference between P[A nB] and P[A]P[5] for A and B two events resp. and 
measurable. A natural question is whether the converse is true, i.e. whether saying 
that P[A nfi] is always close in some sense to P[A]P[fi] implies a control on : '^}. 



We saw in § |1.3.a| that a-mixing does not fit, but maybe stronger conditions of the same 
t5T)e would work. 



In § 2.1 I will present a simple such condition (Theorem 2.1.3 ). This condition de- 
mands |P[A n5] -P[A]P[5]| to be bounded uniformly by C(P[A])0(P[5]) for functions 
C, 9 : [0, 1] U+ sufficiently well behaved. This result, whose proof is rather simple, is 
apparently new. 



Proposition |1.1.18[ however, suggests that the natural condition on events would 
be a uniform control on | P[A nB]- P[A] P[5] | / VPIA"] P[A] VPIB^] PIB], which is out 



of the scope of Theorem |2.1.3[ Bradley [j5l proved in 1983 that that condition was 
indeed sufficient to get p-mixing. His result was improved in the next few years (see 
for instance the bound of [ 10 1), but the optimal bound was remaining unknown, though 



its value was being conjectured. In § |2.2[ , I shall prove this optimal bound. My method, 
different from the techniques of DSmOl, relies on the analysis of the spectral properties 
of an operator linked to a law which I call the "Chogosov law", whose study is proceeded 



to in § 2.2.b 



2.1 Weak event sufficient condition 



To state our next result we need some functional analysis reminders first: 

2.1.1 Definition. On the space '^^^(0, 1) of compactly supported factions of '^^°°(0, 1), 
one defines the scalar product 



{(p,nf)Hi 



I (p'{x)if/'{x)dx. 
Jo 



(BY) 



'tf^(0, 1) endowed with is a pre-Hilbert space; its completion is denoted 

byHi(0,l). 
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Recall that elements of ^^(0, 1) may be seen as ordinary functions: 

2.1.2 Lemma (Sobolev, BH Theorem 4.12]). Any element f gHq(0, 1) can he identified 
with a unique function f g 't^q [0, 1], the space of continuous functions on [0, 1] with 
f(0),f(l) = 0. Conversely, a function f e '^q[0, 1] corresponds to an element ofH^iO, 1) 
if and only if 

\Jifix)g"ix)dx\ 
sup (BZ) 



is finite, and then there is a unique f g Hq(0, 1) associated to f, whose norm is (BZ . 



In accordance with Lemma 



2.1.2 



we will identify functions of "^qIO, 1] with ele- 
ments of Hq(0, 1) whenever it is possible. If /" g '^q[0, 1] does not correspond to an 
element of HqCO, 1), then we will set ll/ll^i = +oo. 

Now we can state the 

2.1.3 Theorem (Weak event sufficient condition). Let ^ and he two o-algehras such 
that, for all AeJ^ and B e'^ with respective prohahilities p and q, 

P[AnB]-pq^ap)eiqf^ (CA) 

for some (,ee %°[0, 1]. Then: 

{^:^^}^||Cll^i||0||j^i. (CB) 

Proof. We begin with the following formula for covariance: 

2.1.4 Lemma. For f and g two real functions, 

Covif,g)= [ {I'[f^xandg^y]--P[f^xmg^y])dxdy. (CC) 

JKxIR 

Proof of Lemma 2.1.4\ Suppose in a first time that f and g are nonnegative. A classical 
Fubini argument (see [4, Problem 21.6]) shows that 

£[/■]= r P[/">x]dx, (CD) 

with a similar formula for g. By the same method, 

E[/"^]= / V[f>xan^ig>y^dxdy, (CE) 

SO that, using the computational formula Cov{f,g) = E[fg]-'E[f]'E[g], 

Covif,g)= [ inf >x and g>y]--p[f> xmg>y])dxdy. (CF) 

Observing that the integrand is also iP[f ^ x and ^ ^ y] - P[f ^ x]P[^ ^ y]) and that it 
is zero for ix,y) tU+ x U+, we get (CCI in the nonnegative case. By translation invari- 
ance, the formula remains true for all f,g bounded below, and then by approximation 
for all A^gL^. □ 



'^*^Note that there is no need to put absolute values in the left-hand side of ( CA i. 
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Now, let f and g be variables resp. ^- and "J^-mesurable, and denote by F and G 
the respective distribution functions of f and g. Up to a slight perturbation, F and G 
may be supposed to be diffeomorphisms from IR onto (0, 1); denote by a and fi their 
respective inverse maps. Then a change of variables in ( |CC[ ) yields: 

Cov(/-,^)= [ (P[/-^a(p)and^^i6(g)]-pg)a'(p)i6'(g)dpdg, (CG) 
J(o,i)2 

so by assumption ( [CA| ): 

Cov(/-,^)^( [V(p)a'(p)rfp)x( [^0(g)^'(g)dg). (CH) 
Jo Jo 

Then our theorem becomes equivalent to the claim stated and proved just below. □ 

2.1.5 Claim. If f is a random variable whose repartition function F is a diffeomor- 
phism of inverse a, then for ( e '^q[0, 1]: 

>i 

ap)a'ip)dp\^U\\HiSd{f). (CI) 



1/ 

Jo 



Proof. First note that, replacing ghy f in ( [CG| ), one has: 

Var(/-)= [ [pil-q)Aqil-p)]a'ip)a'iq)dpdq. (CJ) 

J(0,l)2 

In fact, one can define a scalar product '^^^ {',')v on ^°(0, 1) by setting 

{(p,y/)v= I \j){l-q)Aq{l-p)\(p{p)f{q)dpdq, (CK) 

J(0,l)2 

so that if a is the inverse distribution fuction of a variable f , Var(/") = ||a'||^. 

So, we are considering three scalar products on some subspaces of ^"^(0, 1): the 
ordinary product, which we denote by (•,'>^2, the Hq{Q,1) product ('■>')hI 
variance product {•, •>y. Our goal is to show that for all (peH^iQ, l),y/ € "tf^CO, 1), 

\{(p,y/)L2\^\\(p\\Hi\\y^\\v- (CL) 

By approximation we can suppose that cp g 'io^iO, 1). A direct computation shows 
that 

{(p,'i(/)v = {L(p,f)L2, (CM) 
where the operator L : "^^(0, 1) ^^^(0, 1) is defined by: 

{L(p)ix)-xl (1- y)(piy)dy + il-x) I y(piy)dy. (CN) 

Jo Jx 

But we can make L appear thanks to the following formula: for cp e (0, 1), 

(p = Li-(p"), (CO) 
as one checks by integrating by parts twice. So, 

\{(p,y/)L^\ = \{Li-(p"),y/)L2\ = \{-(p",yf)v\ \\(p"\\v\mv, (CP) 

where 

\\(p"\\l = {(p",cp")v = (Lcp",<p")L2 = -{(P,<p")l^ = Wcp'Wl^ = mil, (CQ) 
whence □ 
^'''^The positivity of (•, •>y follows from the identity {(p,(p)v - fp<qifp (p('')dr)^ dpdq. 
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Figure 1: The function A. 

2.2 Strong event sufficient condition 



2.2.a The strong event sufficient condition 



\J p(l-p), since that would give a converse to Formula (AS I of Proposition 1.1.18| Un 



fortunately \\\/pil- p)\\jji = +oo, so Theorem 
is however a specific result then: 



2.1.3 



A natural choice for functions ( and 6 in Theorem |2. 1.3| w ould be (ip) = Q{p) = e ^^^ x 



does not work in this case. There 



2.2.1 Theorem (Strong event sufficient condition). Let ^ and ^ be two a-algebras 
such that, for all A and B resp. in 3^ and with respective probabilities p and q, 



P[A nfi] -pq^ E^pil - p)q{l - q) 



for some e £ [0, 1]. Then 



where A : [0, 1] U+ is defined by 



Aie). 



£(l + |log£:|) ifE>0, 
ife^O. 



(CR) 



(CS) 



(CT) 



2.2.2 Remark. The function A is increasing on [0,1] and satisfies A(0) = 0, A(l) = 1, 
and A(e) > e for all e g (0, 1). Moreover it is continuous, in particular A(e) \ as e \ 
(see Figure [l]). 



2.2.3 Remark. I called Theorems 2.1.3 and 2.2.1 resp. "weak" and "strong" event suf- 



ficient conditions; yet that vocabulary is a bit misleading, since the strong condition 
does not imply the weak one stricto sensu: with the hypotheses of Theorem |2. 1.3| in- 



deed, Theorem 2.2.1 only implies that 



{^:<;^}^A(||C||^i||0||jji) 



(CU) 



But the right-hand side of ( [CU| ) tends to as soon the right-hand side of (jCBj) does, so it 
is relevant to say that Theorem 2.2.1 is 'qualitatively stronger' than Theorem [2.1.3 



2.2.4 Remark. With the same informal vocabulary. Theorem 2.2.1 is a 'qualitative 



converse' of Proposition 1.1.18 maximal decorrelation is 'qualitatively equivalent' to 



decorrelation of events as defined by Formula (AS I. 
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2.2.5 Remark. One can prove that the bound A(e) in ( CS I is the best possible: see § 



2.2.C 



Proof. The core principle of the proof is the same as for Theorem |2.1.3[ except that we 



first perform a tricky refinement of the hypothesis: observing that, for A and B with 



respective probabilities p and q, one trivially has P[A r\B]^ p Aq, the bound ( |CR| > can 
be strengthened into: 



P[AnB]^ipq + E\/pil-p)qil-q))ApAq. 



(CV) 



The right-hand side of (CVl will be denoted by Ze(p,g). 



Now, like in the proof of Theorem 2.1.3 if (CRI is satisfied, for f and g two L 



real variables resp. and "Jf -measurable, having respective distribution functions F 
and G with respective inverses maps a and jS: 



Covif,g)^ [ iZeip,q)-pq)a'ip)p'iq)dpdq. 

J(0,l)2 



(CW) 



Call {a',p')z, the right-hand side of (CWl 



To bound {a', I5')zs, this time we are remaining on a random variable paradigm: 
2.2.6 Definition. The Chogosov lau. is the (unique) probability measure /i on (0, 1)^ 



such that 



V(p,g) G (0, if m[(0,p) X (0,g)] = Z,(p,g). 



(CX) 



It shall be proved in § 2.2.b that the Chogosov law actually exists, jx is invariant 



under switching xi and X2 as the function Z{p,q) is, and its marginals are uniform 
on (0,1) as Z{p,l) = p. 

The Chogosov law is linked to (•, •)Zj by the operator defined next: 

2.2.7 Definition. For p g (0, 1), denote by jUp the conditional law of xa knowing that 
xi= p under jj,: {Hp)pe(Q,i) is the family of probability laws on (0,1) such that for all 
measurable X c (0, Vf, 



ju[X]= ^ipViq: {p,q)eX}]dp 
Jo 



[§] 



(CY) 



2.2.8 Definition. Let if be the operator on bounded functions on (0, 1) defined by 

{S£P){p):= C piq)diJ,p[q]; (CZ) 
Jo 

in other words, £^ is the generator of the stationary Markow chain ... ^ rg ^ ri—>- ... 
with uniform equilibrium measure on (0, 1) such that the (rj,ri+i) have law ju. 

'^*^So called in honour of my dear friend M. K. Chogosov. 

^^^Stricto sensu the family of measures (/ip)pe(o,i) is only defined up to a.s. equality; however, this 
family has a unique version such that p ^ fip is continuous for the weak convergence of measures, 
which is the one that we will consider in the sequel. 



32 



Then the very definition of (•, ')z^ yields: 



<a',i6'>z, =Cov(a,if;6), (DA) 

where by writing "Cov(a, S£py' I consider functions a and S^^as real random variables 
on the probability space (0, 1) endowed with the uniform measure. 

By the Cauchy-Schwarz inequality, it is then enough to prove that Yar(££'P) ^ 
A(£:)Var(j6), i.e. that the operator norm of if on L^(0, 1) is bounded above by A(£:). 



That work is achieved by Lemma 2.2.13 in next subsection. □ 



2.2.b The Chogosov law 

This subsection deals with the "Chogosov law", which we introduced in the proof of 
Theorem iXH 

Throughout this subsection we suppose e g (0, 1) fixed and we write A for Aie), 
resp. Z for Z^. The drawings will be made for e = 1/2. 



Recall Definition 2.2.6| of the Chogosov law. We first have to check that the Chogo- 



sov law actually exists: 

2.2.9 Claim. There exists a (unique) probability measure y. on (0, 1)^ such that 

V/3,ge[0,l] ij\_{ixi,X2)^{Q,lf: xi ^ p and X2 ^ q}]^ Zip,q), (DB) 
where we recall that 

Z{p,q)~{pq + E\/p{l-p)q{l-q))hp hq. (DC) 



Proof. (iDBl) means that the density of y, on (0, 1)^ is equal to the distribution Z; the 



non-trivia 



XIX2 

point consists in proving that that distribution is nonnegative. 



From now on in this subsection elements of (0, 1)^ will be automatically denoted 
by ip,q). Moreover, we will denote p ■■= 1- p and p ■= p - 1/2, resp. q ■= 1- q and 
q:^q-V% 

The analytic formula defining Z(p,q) depends on the zone of (0,1)^ in which (p,q) 
lies (see Figure [2]l: 

• If pq qp ^ e^, then Z(p, q) = p and we will say that we are in zone ®; 

• If ^ pq qp ^ e~^, then Z(p, q) = pq + \/ppqq and we will say that we are in 
zone (2); 

• If ^ pq ^ qp, then Z{p,q) = q and we will say that we are in zone (3). 

So the expression of dpZ depends on the zone where one lies: in ® it is "1", in (D it 
is "q - £p\/qq pp", and in (3) it is "0". Anyway it is defined and finite everer5rwhere, 
just having jumps at the borders between the zones, which borders we will denote 
respectively il for the border between ® and (2), and D for the border between (2) 
and (3) (see Figure [2j). To prove that the distribution d'^^Z is nonnegative, we have to 
show that dpZ is increasing in g at p fixed. Let us check it: 

• In ® and (3), dpZ is differentiable with dq{dpZ) = 0^0; 
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Figure 2: The Chogosov law fj.. On the left are drawn the different zones relative to the support 
of the measure; on the right is a cloud of 2,048 independent points with law //. 



In (D, dpZ is differentiable with dqidpZ) = 1 + epq \/ppqq. Denoting by p{p,q) 
that expression, let us prove that pip, q) is nonnegative (and even positive) in (2): 
either p and q have the same sign and then p(p,q) is trivially ^ 1, or p and q 
have opposite signs. In the latter case, say for instance that ip ^0 and q ^ 0). 
Then p ^ 1/2 and q ^ 1/2, so \p\- p- 1/2 < p and \q\ = 1/2- q < q, which imphes 
that 

\PQ\ ^ [P^ ^ ,/ -2 1 mn^ 
< £i/ ^ ^ ev e =1, (DD) 




\/ppqq V 
so that p(p,q) > 0. 

On 2), 5pZ makes a jump. Denote by the unique g such that ip,q) g S). 
When q tends to by lower values, ip,q) is in (3), so dpZip,q^-)^0, while 
when g tends to g^ by upper values, ip,q) is in (2), so dpZip,q^+) - qp - 



V Q'p PP- But on 2), qp = e pq, so 



SO that the jump oi dpZip, •) at g® occurs in the increasing sense. 
• Similarly we find that on il, with obvious notation, dpZip,q^+) - dpZ{p,q^-) = 
q^^2p>0. 

So we have proved that dpZ(p,q) is increasing in g, which is what we wanted. □ 

2.2.10 Remark. The measure /i has a rather complicated structure: it is supported by 
zone (2); it has density 1 + epq \J ppqq w.r.t. the Lebesgue measure in the interior of 
that zone, and on its boundaries it has a linear density giving a mass {q^2p)dp to the 
infinitesimal part of of abscissa p, resp. a mass (g 2p)dp to the infinitesimal part 
of il of abscissa p. See Figure [2] 

Now that its existence is ensured, we notice a crucial property of the operator 5£: 

2.2.11 Proposition, if is self-adjoint on L^(0, 1). 
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2.2.12 Remark. As S£l = 1, we can also consider 5£ as an operator on the quotient 
space L^(0, 1), on which it shall also be self-adjoint. 



Proof of Proposition ^.2.1 1\ Indeed {a,^^) = E^[a(p))6(g)], which is invariant under 
switching a and j6 as )U is invariant under switching p and q. □ 

Now we can turn to the main result of this subsection: 

2.2.13 Lemma. The operator norm of ^ on L^(0, 1) is hounded above by A. 

Proof. Let r] e (0, 1/2), devised to tend to 0, and define the distance dj^ on (0, 1) by: 

ypi<P2 dr^ipi,P2)--= ippr^'^^'^dp. (DF) 

J Pi 

For continuous f : (0, 1) U, define 

\fiP2)-fipi)\ 

\\f\\Lip{i^)--= sup — — — , (DG) 

and denote by Lipirj) the set of functions f with WfWupiri) < oo. Lipirj) is obviously 
complete for || • Wupiri), yet that semi-norm is not definite since it is zero for any constant 
function. We thus define Lipir]) as Lipirj)/U, which is actually a Banach space. I claim 
that 

2.2.14 Claim. Lipir]) is continuously imbedded in L^(0, 1), i.e. there exists some C <oo 
(depending on rj) such that for all f € Lipirj), Sdif) ^ CWfWupir])- 



Proof of Claim ^2. 14\ Fix some arbitrary po e (0,1). For f e Lipirj), denoting yo ■= 
fipo), one has, for all p g (0, 1), 



\fip)-yo\ ^ \\f\\Lipi^)\ r(gg)-^^2+Vg 



(DH) 



whence: 



Sdif)^^j\fip)-yo\^dp ^ \\f\\Lipir,M\\\qq)-^'''^^dqfdp. (DI) 



Since r/ > 0, the integral in the right-hand side of (DI I is finite, which proves the claim. 

□ 

Now, the cruxpoint is the following claim, whose proof is postponed: 
2.2.15 Claim, (i) There exists a constant A^ < cxa such that for all f g Lipirj), 

\\^f\\Lip(r^)^^r^\\f\\Lip(r^)- 

(ii) It is possible to choose A^ so that lim ^^ ^ A^ ^ A. 



Using Claims 



2.2.14 



and 



2.2.15 



one has for all f € Lipirj), for neN, 



Sd(if "/■) ^ CW^^'fllLipirj) ^ CA';^\\f\\Lipi,) " 0(Ap. 



(DJ) 



By Lemma 



0.3.1 



since if is self-adjoint on L^(0, 1) and Lipirj) is a dense subset 



of L^(0, 1), (D J I implies that |||i?||lz;2(o,i) ^ A^. Making 77 \ 0, |||i?||lL2(o,i) ^ A, QED. 



□ 
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Figure 3: The function Q{p,w). This drawing plots the functions Q{',(j)) for values of a» running 
from to 1 with step 0.02. Note that all these functions are defined on the whole (0, 1): in fact 
the graph of Q 'merges' with X) beyond a certain point for a» < 1/2, resp. it merges with 11 below 
a certain point for cd > 1/2. For cd < e^I2, resp. o) > 1 - e^/2 (which corresponds here to cti < 0.125, 
resp. tx) > 0.875), the whole graph of Q(p,') is actually equal to the curve D, resp. il. 



Proof of Claim 2.2.15 The proof relies on monotone rearrangement of measures 
(cf. Dm p. 75]). For p e (0, 1), co e [0, 1], define 



Q(/3,a»):=inf{gG(0,l): dpZip,q)^a)} 



(DK) 



(see Figure |3]), so that Q{p,(i)) is nondecreasing in a» and that, for w with uniform law 
on [0, 1], the law ofQ{p,ii)) is the conditioned version y.p of the Chogosov law, therefore 
giving: 

{^f){p) = Y.[f{Q{p,oj))] 



Then one has the following 'coupling formula': 

{^f){p2) - i^f)ipi) = E [fiQip2,0j)) - f(QipiM)l 



From ( |DM| ) we deduce that 

\{^f)ip2)-{^f)ipi)\ ^ \\f\\LipinMdrj{Qipi,C0),QiP2,(^m. 

So, if we can prove that for a\\ pi < p2, 

E[dTjiQipi,co),Q(p2,(o))] ^ Kr)dr^{pi,P2), 

then we are done. 



(DL) 



(DM) 



(DN) 



(DO) 



Now I claim (it will be checked later) that Q is absolutely continuous w.r.t. p, i.e. 
that there exists an integrable function Q' : (0, 1) x [0, 1] ^ IR such that for all ii),pi,P2 
one has Q{p2,oi) = Qipi,(D) + f^^ Q'(p,a))dp. Introducing that function, (DDI becomes: 



E 



[| {QipMQipMY^'^^'^Q'ipMdpW^A^^il ipp)-^'^^"^ dp\ (DP) 
Jpi Jpi 



'^'"^Beware that here the expectation is not taken w.r.t. p but w.r.t. en. 
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so by Fubini's theorem (which is legal here since, as we will see later, Q' is bounded). 



proving (jDPj) for all < P2 is tantamount to proving that, for all p £ (0, 1), 



(DQ) 



So we have to compute Q'(p,a)). Using the structure of the law )U (cf Re- 
mark 2.2.101, we find the following (see Figure [3]): 

• First if CO < q^^2p, then Qip,co) = , whence Q'{p,Oi)) = dq^/dp. Differentiating 
the equality qp- e^pq defining 2), one finds that dq^/dp - (q^ +E^qp)^ip+E^p), 
which simplifies into q^q^ ^ PP using once again that qp = e^pq. 

• Similarly if to > 1 - 2p, one has Q'ip,co) = q'^q^ pp. 

• If q^ ^2p <a)<l- qp^2p, then dpZ{p,q) = q - ep\/qqlpp, thus differentiating 
the equality dpZ{p,Q{p,(i))) = co, we get: 



Q'ip,io) 



ey Qip,co)Qip,a)) 



^yPP 1+e 



(DR) 



/ ppQ(p,a>)Q(p,(i))' 

• Finally in the critical cases oj = q^ ^2p,l-q^^2p, there is no canonical value 
for Q'icj) since at these points Q(',a») is not but that does not matter. 
2.2.16 Remark. Note that one always has Q'ip,w) > 0, i.e. Q(',a)) is increasing. In other 
words, for pi< p2, Hp^ is stochastically smaller than Hp^. 



We have computed Q'ip,a)), so now we can tackle (DQl: we have to bound 

• 1 



/„(, 



PP 



Jo ^Qip,a))Qip,w) 
uniformly in p. We begin with noticing that 



.3/2-77 
1 Q'ip,aj)da), 



(DS) 



2.2.17 Claim. For all p e (0, 1), all q £ iq^,q^\ one has qqipp ^ £ 



Proof of Claim [272. 1 7\ The condition q g means that ^ pqiqp ^ e ^. Then 

we distinguish two cases: 

• VLp^q, then qqIpp ^{qlpfqplpq ^ qplpq = {pqlqpT^ ^ e"^; 



If p^ q, then qqIpp = iq/p) pq/qp ^ pqIqp ^ e" 



□ 



Thanks to Claim |2.2.f7| we bound (|DS]l by 

PP 



-2rj 



Q{p,(D)Q{p,(D) 



.3/2 

1 Q'ip,a))da), 



(DT) 



which we shorthand into e ^^Mp). Splitting the integral in (DTI according to the value 
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of w (resp. for w e i0,qp2p), iqp2p, 1 - q^/2p) and (1 - q^/2p, 1)), one finds: 



^P^QpQ^' PP 



P 



+ 



PP 



3/2 



eyQ(p,(jt))Q(p,a)) 



rl-q^/2p 

Jqf/2p ^Qip,co)Qip,co)^ i.^^il I r ?Q^P''^^ 

• ' ppQ{p,w)Q(p,cD) 



-dcj 



(DU) 



(DV) 



(DW) 



(DUI simplifies into \Jq^plpq^ = \^ = d2\ similarly (DWI = \J pq^lq^p = e/2. 



Concerning term (DVI, we make the change of variables q =~Qip,w), for which dco = 
(1 + Epql\/ppqq)dq because of the expression of the density of y. in zone (2) (cf Re- 



mark 2.2.10I. One gets 



(DVI 



£ 1 er g 

- / ^ —dq = - log- 



ii 

9? 



efi P 



e p \ £ ± 

log = -log^ = £:|loge| 

p I 4 



e'^p 



(DX) 



So in the end we have Xip) = e/2 + e/2 + e| loge| = A for all p, thus ^ e '^^A (hence 
which tends to A as 77 \ (hence □ 

2.2.18 Remark. The simplifications in the computation of Xip) look rather miracu- 



A). That I found 



lous. . . A priori I only expected that Xip) ^ A on (0, 1) with X(p) 
the exact quasi-eigenvector associated to the quasi-eigenvalue A (cf Remark [2.2.19[ > is 
purely fortuitous; I have no simple explanation for why things work so well. 

2.2.19 Remark. ^ is self-adjoint, hence normal, so its operator norm is also its spec- 
tral radius. Therefore there is some {eigenvalue, eigenvector) pair, or more precisely 
(since here the spectral radius of 5£ is due to its continuous spectrum) some 'quasi- 
eigenvalue' and its 'quasi-eigenvector' (cf [[571 § 4]), which are responsible for the value 
of the operator norm. 



Tracking this quasi-eigenvector throughout the proof of Lemma 2.2.13 we find that 



A is a quasi-eigenvalue of 5£ and that the associated quasi-eigenvector is: 



fK-. P- \\p'p'y^'^dp'. 
J 1/2 



(DY) 



Obviously fx is not in L , so it is not a true eigenvector; however one can perturb 
it slightly to get an element /a g L^(0, 1)-- {0} such that {■^fA,fA)LHo,i)/\\MzHoi) 
arbitrarily close to A. 

2.2.20 Remark. An interesting feature of f\ is that its 'L^ mass' is concentrated about 
and 1, so that one needs only look at what happens near and 1 to understand how /"a 
contributes to the operator norm of if. 

When one 'zooms' more and more to the point (0,0) — the same behaviour would 
happen about (1, 1) — , ju 'looks more and more like' the measure ju* on (0,oo)^ defined 
by (see Figure |4]): 



Vp,gG[0,oo)^ /j*[{(xi,X2) G (0,00)^ : xi ^ p and X2 ^ q}] = E\/pq ^p ^q, 



(DZ) 
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Op Op 



Figure 4: The measure ^* . On the left, the different zones for the measure; on the right, a 
Poisson cloud of points with density fi* . The scale and density of the cloud are consistent with 
Figure |2] 

i.e. 

£ £2 1 

So, near 0, ^ behaves like the operator if* on L^(0,cx3) defined by: 

(Se*f)ip)^r " -^dq+'^f{e''p)+\f{8-''p). (EB) 

iif * has scale invariance properties which make it easy to study. One finds that ^* 
is self-adjoint, that its spectral radius is A, and that it has A as a quasi-eigenvalue, 
associated with the quasi-eigenvector {p l/y^). So, you see that it suffices to study 
the 'local' operator if* to compute the spectral radius of the 'global' operator if; in 
other words, there is a phenomenon of 'localization of the spectral radius' for if. 



2.2.C Optimality of the strong event sufficient condition 



Now I will prove that Theorem 2.2.1 is optimal 



2.2.21 Theorem. The factor A{e) in ( fC^ cannot he improved. In other words, for 
all A' < A(£) it is possible to find a-fields ^ and satisfying 

yAe^,Be(S P[A nfi] -P[A]P[S] ^ £v/P[A]P[A=]P[5]P[5^], (EC) 

but such that {3P :'S}-» A'. 

2.2.22 Remark. One can automatically add absolutes values in the left-hand side of 



the condition (EC I, since -(P[A n5] -P[A]P[fi]) = P[A nfi'^] -P[A]P[5^]. 



Actually I will rather prove the following statement, which is equivalent to the 
theorem by continuity of the function A('): 
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Figure 5: A schematic representation of the measure v. 



2.2.23 Claim. For all e' > e it is possible to find a-fields 3^ and ^ satisfying [3 ■.'^}^ 
A(e), hut such that 



VAe^,fiG^ P[A n 5] - P[A] P[fi] ^ e' v^PLA] PLA^] P[E] P[fic]. 



(ED) 



Proof. According to the proof of Theorem 2.2.1 the 'natural' proof would be to take 
for space (0,^,P) the set (0, 1)^ equipped with its Borel cr-field and endowed with the 
Chogosov law y., and to set 3 = a(p) and - aiq). Though it seems to be true that 



that system satisfies (jCRj), the complicated structure of ju makes existence of a short 
proof for that property unlikely. Therefore I will rather adapt the previous idea to the 



nicer measure /i* defined by (DZ I, or more precisely to a 'truncation' of it. 



My system is the following: (0,^,P) is the set (0,1)^ equipped with its Borel cr- 
field and endowed with a certain measure v (specified just after), and I take 3 = uip), 
resp. = aiq). The measure v, which depends on some parameter x g (0, 1) morally 
close to 0, is a measure on (0, 1)^ having uniform marginals, which coincides with jU* 
on iO,x]^ and which is 'as uniform as possible' outside (0,x]^ (see Figurejs]). Technically: 



ju*(Ax5) if Ac(0,x] and-Bc(0,x]; 

if Ac(0,c^x] andfic(x,l) 

if Ac(x,l)andSc(0,e2x] 

[Xi(l-|y|)(ip]|5|/(l-x) if Ac (e2x,x] and5c(x,l) 



v[A xB] = ^ 



\A\[JBil-^^^)dqyil-x) if A c (x, 1) and 5 c (e^x, x] 
[1 - (2 - e)x]|A||B|/(l - x)2 if A c (x, 1) and B c (x, 1). 



(EE) 



First step: Proof that [3 : ^ A. Let A' < A. Since A is in the spectrum of the self- 
adjoint operator if* on L^(0,oo) (see (EBl and the lines just below), there exists f e 
L^(0,oo) {0} such that (-.S?* /",/")/!!/" 11^,2(0 oo) > By a standard truncation argument. 



we can assume that f has bounded support, say that f is zero outside (0,1^]. Dividing 
f by its norm we can also assume that II/" 11^2 - 1. 
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Now, for y £ (0,x] define the function fy by: 

fyip)--=\l-f\-pV (EF) 



fy is zero outside (0,y]c (0,x]; it satisfies ||/"y|li2 = 1 and 

{5e*fy,fy)^{5e*f,f)>M (EG) 

by the scale invariance properties of if*. 

Denote m ■■= f fip)dp/VY, which is finite since f is with compact support; one 
has / fy(p)dp = s/ym, so the projection of fy on L^(0, 1) is the function fy-fy- s/ym. 
One has ll/Vllz;2(o,i) ^ II/VIIl2 = 1, and 

E [/;(p)/;(g)] = {Se*fy, fy) - m^y > A' - m'^y, (EH) 
so that :^}> A' -m'^y. Making y^O and then A' A, one finally gets :^}^ A. 



Second step: Proof of (ED\ Let e' > e; we want to prove that, provided x is small 



enough, (EDI is satisfied. 



Let A and B be resp. and "J^-measurable events. One can assume safely that 



\A \ ^ 1/2, since replacing simultaneously A by A*^ and B hy B^ leaves both sides of (EDI 
unchanged. One can also assume that \B\ < 1/(1 + £^), since for \B\ ^ 1/(1 + e^), (EDI 
comes 'for nothing' by writing 



P[A nfi] -P[A]P[B] ^ P[A](1 -P[5]) ^ VP[A]P[Ac] x £\/'P[B]'P[B^]. (EI) 

/re ^/le sequel of this proof we indentify A and B with Borel subsets of(0, 1), rewrit- 
ing the p-measurahle event A into the set A x (0, 1), resp. the q-measurahle event B into 
the set (0, l)x5. Since both marginals ofv are uniform on (0, 1), one then has P[A] = |A|, 
resp. P[B] = \B\, so that our goal becomes proving: 



v[AxB]-\A\\B\^e'^\A\\B\\A^\\B^\- (EJ) 

Denote A := A n (0,x], resp. B := S n (0,x]. Provided x ^ e/2, the signed mea- 
sure dv{p,q)- dpdq is nonpositive on (0,x] x (x,l)u(x,l) x (0,x], so that 

v{AxB]-\A\\B\^v{AxB'\-\A\\B\ + v{{A^A)x{B^B)'\-\A^A\\B^B\. (EK) 



Now let us bound above the right-hand side of ( [EK| l: 

• The second term is obviously nonpositive. 

• The third term is [ 1 - (2 - £:)x] | A - A | |B - K ( 1 - x)^ , so the sum of the two last 
terms is {ex-x^)\A^A\\B^B\^{1-x)'^ ^ (ex - x2)|A||fi|/(l - x)^ Since |A| ^ 1/2 
and IS I ^ 1/(1 + e^), that quantity is in turn bounded by x v/|A||5||Ac||5=|. 



• For the first term, by Lemma 2.2.24 stated just below, one has v[A x fi] = 



^* [A X 5] ^ e\/\A\\B\ ^ eV'lIllA^llBllB^I -^(1 - x), in which, provided x ^ £^ (1 + e^), 
one has \/\ b\\b^\ ^ \/B\B^ (because then \B\ ^ |5|ax and \B\ ^ 1 - x), and 
similarly v|A||A^ ^ \/A|A^|, so that in the end the first term is bounded by 
^^sJ\A\\B\\A^\\B% 
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Summing things up, we get: 

2 

v[A X S] - |A||S| ^ (-^ + ]y/\A\\B\\A-\\B^\. (EL) 



Taking x sufficiently close to 0, the first factor of the right-hand side of (ELI is ^ e', 



whence the second step of the proof □ 

2.2.24 Lemma. For all A, Be (0,oo)^ with Lebesgue measures |A|, |5|< oo, /i*[A x 5] ^ 
eV\A\\B\. 



Proof of Lemma 2.2.24 Recall that /i* is the Radon measure on (0,oo)^ having density 
elA^sJpq wr.t. the Lebesgue measure inside the cone C = {{p,q) : e^p <q < £~^p}, being 
zero outside C, and giving to the borders of C a lineic mass defined by iJ*{{p,E^p): 
p eA} = E^\A\I2, resp. jj* {{p,E~'^p): p e A} = \A\I2 (see Figure |4]l. y* is invariant un- 
der switching p and q, and its marginals both are the Lebesgue measure on (0,oo). 
Let A,B c (0,oo) be Borel; our goal is to show that ju*[A x 5] ^ £\/\A\\B\. 

Step 1. If |A| ^ £^\B\ the result is trivially true, since then y*[A xB]^ iJ,*[Ax 
(0,oo)] = |A| ^ Cv^lAIIEI. Similarly the result is true if |5| ^ e^|A|. Therefore in our 
proof we will always assume that£:2|A|<|fi|<e-2|A|. 

Step 2. As for the measure jj., decompose the support of ju* into three parts il, (2) 
and D, corresponding resp. to the line "p = e^q", the cone C and the line "p = e~^q" 
(see Figure|4]l. Write fJ.*[A xB] = mu + m2 + mo, where mu = iU*[(A x 5) nil], etc.. 

Denote by //* the 'conditioned version' of fi knowing q, i.e. the probability measure 
such that 

M*[X] = j^ ^i;[{p: ip,q)eX}]dq, (EM) 
which can be computed explicitly to be: 

el 

d^lip] = % = ,2^} y + 1{,2^ < p < ,-2^j ^dp + 1^^ ^ -. (EN) 



The three terms of the right-hand side of (EN I are respectively due to il, (2) and 'D, so 
that, integrating the first one, one finds: 



e2 



Switching the roles of p and q, one has similarly mo ^ £^|A|/2. Then it only remains 
to bound m2- 

Step 3. Let us study further the measures If g g e^A, then A 3 e~^q and thus 
;u*[A] ^ )U*[{£"^g}] = 1/2, and conversely q € e^A, then A ^ £"^g and thus /x*[A] ^ 
l-Ai*[{£-^g}] = l/2. So, ;u;[A] is never smaller if g g e^A than if g f e^A. 

As a consequence, let us show that we can always assume that e^A c B. Since |S| > 
e^lAI, IS-e^AI > \e^A^B\, so we can fix some fi" cfi -^e^^ such that = \e^A^B\. 
One has: 

/i*[Ax5-]=r ^*(A)(ig ^ ^ = ^ f |u!(A)(ig = Ai*[A x(e2A-B)]. (EP) 
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Shorthanding "iB-~B-)ue^A" into "B"', (|ep|| implies that replacing B by B' —which 



does not modify the value of \B\ — cannot make /i*[A x B] decrease. Consequently, if 
we prove that /i*[A x B'] ^ e\/\A\\B'\, then we will also have proved that /i*[A xB] ^ 
e\/\A\\B\. As e^A cfi', we thus have demonstrated the statement at the beginning of 
this paragraph: one can always assume that e^A (zB. 

Switching the roles of p and q, we will rather impose, instead of e^A c B, that 
e^BczA. 

Step 4. Call /i° the measure /i* restricted to C, i.e. = l(jdii* , so that m2 - 
)U°[A X B]. fj.° is absolutely continuous w.r.t. the Lebesgue measure; denote by /i° its 
'conditioned version' for fixed g, i.e. the measure such that 

IJ°[X] = ^;[{p: ip,q)eX}]dq, (EQ) 

which has the following explicit density w.r.t. the Lebesgue measure: 

d^ll^p] = i{,2^ < ^ < e-'q}X^^P- ^^^^ 

We perform a change of variables: for y e (0, \B\), define 

I3iy) = mf{qei0,oo): \B niO,q)\ ^ y}; (ES) 

so that the push-forward I5#dq of the Lebesgue measure on (0, by the map p is 
equal to Isdq, the Lebesgue measure restricted to B; then 

^® = j^f^l^^^dq = ^°(y)[A](iy. (ET) 

Our strategy will consist in bounding /i^^^j[A] for all y. 

First, we observe that there is some portion of A which does not contribute 
to ifp^^y^lA]. Denote indeed Ay ■■- {e^q : q e B r\iO,piy))}; by the definition of p, 
\Ay \ = E^y, and one has Ay c e^B c A. But Ay c {Q,e'^ (i{y)), so if^^^^-^VAy] ~ 0, and thus 
/i°(^)[A] = //°(^)[A-A3,], where |A - A^l = |A| - lA^,! = |A| -t^y. 

Now, for q G (0,oo), the density of /i° is zero for p ^ e^g and it is nonincreasing for 
p > E^q, so an immediate coupling argument shows that the maximal value of iU° [X] 
under the constraint = x" is attained for X = i£^q,E^q + x). Applying that result to 
the conclusion of the previous paragraph, we get that: 

M°(^)[A] ^ E^Piy) + \A\ - E^y)]. (EU) 

But for X ^ 0, the quantity /i° [(e^g , E^q + x)] can be computed explicitly to be 

°rr2 2^x1 f il- e^)/2 ifg^x/(e-2-£2); 
^'^^^^ ^'^ ^ + ^^^ = 1 (.v/i^-.^)/2 ifq>x/iE-'-E\ ^^^^ 

In particular, that quantity is a nonincreasing function of q. Since, by the definition 



of /3, one always has piy) ^ y, it follows that (EUl can be improved into: 
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Integrating, one finds finally: 

"1^1 (E^/\A\ E^- 

m2 ^ ■ ' ■ 



(l-£2)e2|^| 



dy 



^ ■E^\A\\B\--{\A\ + \B\). (EX) 

£2|A| 2 



Step 5. We put our bounds together to get the lemma: 

2 2 

//*[A X S] ^ mz) + mc/ + m2 = ^(|A| + + tvlA^BI - |-(|A| + = eV\A\\B\. (EY) 

□ 

2.2.25 Remark. A careful reading of the proof above shows that the maximal value 
of ;U*[A X B] is attained for A = (0, \A\),B = (0, \B\), in which case, provided e^|A| ^ |S| ^ 



e I A I, one has equality in Lemma 2.2.24 
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Chapter 3 



Tensorization 



3.1 Subjective correlation 



In this chapter we will need more advanced definitions for decorrelation. 

3.1.1 Definition. Let X, Y and Z be random variables. For e 3^ 0, one says that X 
and Y are subjectively e-decorrelated w.r.t. Z (or e-decorrelated seen from Z) if X and Y 
are e-decorrelated under the law Law{X,Y\Z = z) for Lau;(Z)-almost-all z ^*^ 

The smallest e such that X and Y are e-decorrelated seen from Z will be called 
the subjective correlation level between X and Y w.r.t. Z (or correlation level between X 
and Y seen from Z); we denote it {X : Y)z- 



In § 1.1 we had given the definitions in terms of a-algebras rather than random 
variables. Of course there is also a cr-algebra definition for subjective correlation, 
though I find it harder to understand: 

3.1.2 Definition. Let 3^ , and J€ be cr-algebras. For e e [0, 1], the expression "{^ : 
^ e" means that for all / e L\g^\je) and all g e L\^\je) satisfying E[f\^] = 0, 
resp. E[g\^] = 0, one has: 

|E[/-^]|^eSd(/-)Sd(^). (EZ) 



We let the reader check that with that definition, for X, Y and Z random variables, 
{X:Y}z = {aiX):aiY)}a^Z). 

3.1.3 Remark. The ordinary correlation can be seen as a particular case of subjective 
correlation, since •.'^} = : }^ for © -{<^,D.} the trivial cr-field. 

3.1.4 Remark. Warning! Writing that {X : Y)z ^ £ does not imply that for all subset C of 



the range of Z, X and Y are e-decorrelated under Lau;(X,Y"|Z g C): see Examples 3.1.8 
and [3T9] below. 

3.1.5 Remark. Warning again! There is no general inequality between {X\Y\ 



and {X:Y\z'- see Examples 3.1.7 and 3.1.8 below. 



f*^The conditional laws Law{-\Z - z) are only defined up to Lai(;(Z)-a.e. equality, whence the need to 
specify "for Lau;(Z)-almost-all z". 
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3.1.6 Example. Let f: U^R+he a nonnegative continuous function with j^f{x)dx = 1 
and let (X,Y,Z) be a variable on with density 

dP[(X,Y,Z) = (x,y,z)] = — /"(2)exp(sinh2 -xy - I coshz • (x^ + y'^)) dxdydz . (FA) 
Then, conditionally to "Z = z", {X,Y) is a Gaussian vector with Var(X) = Var(F) = 



coshz and Cov(X,y) = sinhz, so by Theorem 1.2.6 under the law P['\Z = z] one has 



{X : F} = Itanhzl. Consequently {X : Y\z = sup{|tanh2| : fiz) > 0}. 

The three following examples show that subjective correlation may behave rather 
wildly, especially when one changes the cr -field of reference: 

3.1.7 Example. Let X and Y be independent variables with uniform law on WZ and 
\et Z - X + Y; then {X : Y}z = 1: under P['|Z = z] indeed Y is X-measurable (and not 
constant), since Y = z - X. 

3.1.8 Example. Let a,(i,Y be three independent random variables uniform on {0,1}; 
define X = ir,a),Y = (r,j6) and Z = j. Then, conditionally to "Z = 0", X and Y are 
independent with common law uniform on {(0,0),(0, 1)}, and similarly X and Y are 
independent conditionally to "Z = 1", so {X : Y}z = 0. Yet X and Y are not indepen- 
dent since the events "X e {(0,0),(0, 1)}" and "Y e {(0,0),(0, 1)}", which are non-trivial 
under P, are equivalent, so that {X :Y} = 1. 

3.1.9 Example. Let X = (Xi,X2) and Y - {Yi,Y2) be independent with uniform laws 
on {0, if and define Z = iXi,Yi); then one easily checks that {X : Y}z = 0. Now let Z' = 

- y^, which is Z-measurable; one has {X : Y}z' = 1 since, for instance, under "Z' = 
1" the events "Xi = 0" and "Yi = 0" are non-trivial and equivalent. 

Now we define a more restrictive concept of subjective correlation. 

3.1.10 Definition. A a-metalgehra M is a set {^: J6' e J{} of cr-algebras which is 
stable under the "V" operator, i.e. such that for any M' c Ji, V jeeM' ^ 



One can speak of the 'cr-metalgebra spanned by some set of cr-algebras', as states 
the following immediate proposition: 

3.1.11 Proposition. If {^k)keK is a set of a-algebras, then there is a smallest a- 
metalgebra containing all the J€}^, which is 

Ji^{\l jek;K'^K\. (FB) 



When one deals with random variables rather than cr-algebras, one has the follow- 



ing variant of Proposition 3.1.11 



3.1.12 Proposition. Let {Zk)k£K be a set of random variables, then the a-metalgebra 
spanned by {a(Zk): keK} is {a{Zx'): K' (zK}. 

3.1.13 Definition. Let ^ and be cr-algebras and ^ be a a-metalgebra. We define 
the correlation between ^ and seen from M by: 

{J^ : mji = sup : m^e- (FC) 
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3.1.14 Remark. Speaking in terms of random variables, if X, Y and {Zk)keK are vari- 
ables, denoting by ^ the cr-metalgebra spanned by the Zk, then {X : Y)ji is the supre- 
of the {X : ¥} when taken under all the laws of kind P[-\Zx' = z^i] for K' a 



mun' 



[t] 



subset ofK and Zk,k eK' elements of the respective ranges of the Zk. 

Finally, the following proposition gathers some easy properties of relative correla- 
tion wr.t. a (T-metalgebra: 

3.1.15 Proposition. 

(i) Call Ji(f, the trivial a-metalgebra, that is, Jitf, = {&}; then for all a-algebras ^ 
and^, {^■.m = {^:'S}j^^. 

(ii) If.M'^ . ((', then ^ {.'^ -.'m./f. 

(Hi) Let 3^ and he a-algebras, let Ji hea a-metalgebra, and call J£ the a-metalgebra 
spanned by Ji, ^ and then : = {^:'^} j^. 

3.1.16 Definition. In the sequel, the probabilistic systems which we shall consider 
will often be made of some 'elementary' variables, say {Xi)iei. In this case, the so- 
called natural a-metalgebra of the system will mean the cr-metalgebra spanned by 
the Xi. 



3.2 Simple tensorization 



Now we turn to tensorization. First let us deal with 'simple' tensorization, by which 
I mean that tensorization is performed on only one variable. The main result of this 



section will be the 'A'^ against 1' theorem (Theorem 3.2.21. 



The problem considered is the following: Let / be a set and iXi)iei,Y be random 
variables; call J{ the natural cr-metalgebra of this system, that is, the cr-metalgebra 



spanned by the Xi and Y (cf Definition 3.1.161. Suppose we have bounds {Xi : Y\ji ^ e 



for all i; the question is, can we deduce from them a bound on {Xi : Y}? We shall prove 



that the answer is "yes", and moreover the bound ( FM ) we will give is optimal in some 



way (see § 3.5 1 



For pedagogical purpose, let us first state and prove a weaker but easier proposi- 



tion: 



3.2.1 Proposition. With the notation above. 



iel 



(FD) 



Proof. By Proposition 1.1.12 we may assume I = {1,...,N}. Let f and g be centered 



X-measurable, resp. F-measurable, functions; our goal is to bound |E[/"^]|. 
For all i G {0, . . . ,N}, denote 

:^i = aiXi,...,Xi), 

and for all i e {1,...,A'^}, 

fi = f^^-E[f\J^i-i]. 



(FE) 
(FF) 



'^^^More precisely it is a true supremum (over K') of essential suprema (over zk')- 
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Then f = Y.ifi, where each fi is -measurable and centered w.rt. ^t-i (i.e., 
E\-fi\^i-i\ = 0)- Consequently, for all io < ii one has E[/"ig/"jJ = (since fi^ is ^i^- 
measurable while fi^ is centered w.r.t. => ^Iq) and thus when one expands 

Var(/") = E [(Xj /j) ] all the non-diagonal terms vanish, yielding: 

N 

Var(/-)=^Var(/-i). (FG) 
Now, the decomposition "f = Y.i fi" yields 

N 

E[fg] = Y^E[figl (FH) 

so let us bound the |E[/"j^]|. The law of total expectation gives: 

E[fig] = jE[fig\Xi=xi,...,Xi.i=Xi-i]dP[xi,...,Xi-i]. (FI) 

But under dP[-\xi,...,Xi-i], ft is Xj -measurable and centered while g is F-measura- 
ble, moreover under this law {Xi : Y} ^ {Xj : Y}^^^ ^ {Xi : Y)ji ^ e^, so: 

|E[/"j^|ri = yi,...,Yj-i =yj-i]| ^£:iSd(/"j|xi,...,Xj_i)Sd(^|xi,...,Xj_i). (FJ) 

Using the bound Sd(/i) ^ \jE\h?'\ it follows that: 

\EVfig\\^£i J \jE[ff\xi, . . .,Xi-i\\jE[g'^\xi, . . . ,Xi-i\dP[xi, . . .,Xi-i\ 

Ej ^ J E[/"f |xi, . . . , Xi-i\dPVxi, Xi-i]\/the same for g = et Sd(f) Sd(gi). (FK) 



CS 



So, summing ( FK I for all i : 



N 



mfg]\ ^J^si sdifi)Sdig) ^ IL^^JL^Mfi) sdig) = jY.^1 sdinsdigi (fd 



Since ( FL I is true for all /", ^, ( FD I is proved. □ 



It is striking in Proposition 3.2.1 that the right-hand side of ( |FD| ) may be greater 



than 1, which is never the case for a correlation level. Actually there is some 
'loss of optimality' in the proof of the proposition when we bound above Var(/"j|^j_i) 
by E[/-f |^j_i], since E[/-f |^j_i] - Var(/-d=^i-i) = E[fim-if may be different to 0. We 



will use a technique for 'recycling' that loss to get the following result, which § |3.5| shall 
prove to be optimal: 

3.2.2 Theorem ('A'' against 1' theorem). Take the same hypotheses as in Proposi- 



tion |3.2.i[ \/i e I {Xi : Y]ji ^ ei, where Jl is the natural o-metalgehra of the system. 
Then: 

{Xi:Y}^Jl-Y\a-ep. (FM) 

iel 



3.2.3 Remark. The right-hand side of (FMl is the e g [0,1] characterized by 1 - 
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rem 



3.2.2 gives back Proposition 3.2.1 as a corollary. 



3.2.4 Remark. The right-hand side of (FMl is bounded above by , so Theo 



3.2.1 



Proof. As in the proof of Proposition 
resp. y -measurable, functions. Assume / 
fi ■= f^' - Wfm-il Also denote, for i g {0, . . . , A^}, 



let f and g be centered X-measurable, 
= {1,...,A'"}; denote ■■= a{Xi,...,Xi) and 



g':=g-WgWil 



(FN) 



As before, one has Var(/") = Zi^arifi) and E[/"^] = If^^EL/i^]. But fi is centered 
w.r.t. ^i-i while {g- g'^~^) is =^j_i-measurable, so E[/"j^] = 'E,\_fig''~^]. Since, condition- 
ally to ^i-i, fi and g^~^ are both centered and resp. Xj- and Y -measurable, the fact 
that {Xi : Y)ji ^ implies, by the same argument as in the previous proof, that 

mfid'H ^ eiSd(/-^)Sd(^^-^). (FO) 



Now, for i e {1, . . . , A'^}, denote 

t = n.d-\3^i\ (FP) 

Since g^~^ = g'' +g\ where g'' is -measurable while g^ is centered w.r.t. ^i, one has: 

Var(^^) = Var(^^-i)-Var(^^). (FQ) 

Then, the point consists in making the following observation: for Var(^') to be large 
(that is, close to Var(^'"^)), Var(^') has to be small. But in that case |E[/"j^]| shall be 
small: one has indeed, since fi is -measurable. 



mf,g]\ = mfig'-^]\ ^\E[fiig'-^rq\ = \E[fin\ ^sdifi)sdig'). 

Let us sum up the relations obtained. One has, for all i £ {1, . . . , AT^}: 

mfig]\ ^ EiSdifi)Sdig'-^); 
Sdig') = 



(FR) 



^Sd(^^-i)2-Sd(^02; 

mfig]\ ^ sdifi)sdig'). 

Now define Si ■■= mfig]\ ^ Sdifi)Sdig'-^), or = if the right-hand side is ^ 0. Then 



(FS) 
(FT) 
(FU) 



(FS) ensures that Si ^ ej, and (FUl means that Sd(^') ^ Ei Sd(^^-^), so that (iFTll yields 



Sd(^') ^ y 1 - Sd(^' ^). Since g^ = g, one has therefore by induction Sd(^^) ^ [1 •/ 
Sdig), so that the decomposition "E[fg] = T.iE[fig]" gives: 



N 



i-1 



mfg]\^Y^iii n Ji-ei)sdifi)Sdig). 

i = l i'=l 



(FV) 



By the Cauchy-Schwarz inequality, it follows that 

mfg]\^ 



E^'n(i-^?)sd(/-)sd(^): 

i = l i'=l 



N 



l-n(l-^?)Sd(/")Sd(^). (FW) 

i = l 



Obviously the maximal value for the right-hand side of (FWl is when Si - ei for all i, 
then yielding (iFMb. □ 
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There is an alternative proof, which is less intuitive but whose reasoning shall be 
used again in the proof of Theorem 3.3.1 



Alternative proof of Theorem \3.2.2\ We use the same notation as in the previous proof 
As f is ^-measurable, E[fg] = E[fg^], so by the Cauchy-Schwarz inequality: 

|E[/-^]|^Sd(/-)Sd(^-^). (FX) 



Now, by associativity of variance Sd(^"^) = \/Var(^) - Var(^ - g^), so by (FX I it suffices 
to prove that 

N 

Varig-g^)^Y\il-E^)V8irig). (FY) 

i = l 

With our notation, g - g^ = g^ and g - g'^;we will prove that for all i e {1, . . . , A'^}, 

Yarig')^il-e^)Yarig^-h (FZ) 
Since g''~^ and g'' are centered w.r.t. ^i-i, one has 

Var(^^-^) = jyarig'~^\xi,...,Xi-i)dP[xi,...,Xi-i], (GA) 



with a similar decomposition for Var(^'), so that it suffices to prove (FZ) conditionally 

to ^i-l. 

Conditionally to ^i-i, g^~^ is centered and Y-measurable. Moreover, g^ = g^~^ - 
(^gi-iyiXi)^ SO by associativity of variance Var(^') = Var(^'"-'^) - Var((^'"-'^)'^*^'^), and 
therefore ( [FZ| ) is equivalent to 

Wariig'-^Y^^^^) ^ var(^^-i), (GB) 

which follows directly from the assumption "{Xj : Y}^-^ ^ e". □ 



3.3 Double tensorization 

Simple tensorization as itself is already interesting since it gives an L^-type bound 
for the correlation between X and Y, which is better than the L^-type bounds typi- 
cally obtained by total variation methods. Yet it does not exhaust the full potential 
of Hilbertian correlations concerning tensorization, since obviously it does not contain 



results like independent tensorization (cf § l.l.el. 



The aim of this section is to get sharp tensorization results where we perform ten- 
sorizing on both sides, without having to assume complete independence like in The- 
orem |1.1.19[ The price to pay is that the techniques involved, though similar in their 



spirit, will be much more tricky, moreover the bounds obtained will not be completely 



optimal (see § 3.5 1. 
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3.3.a W against M' tensorization 



The following theorem may be considered as the main result of this monograph. As will 



be explained in § 3.5.b it 'contains' qualitatively all the other tensorization theorems 



(i.e. Theorems 1.1.19, 3.2.2 and 3.3.10) 



3.3.1 Theorem {'N against M' theorem). Let I and J be sets, and let {Xi)iei and (Yj)jej 
be random variables, the a-metalgebra they generate being denoted by M. Suppose for 
ciny i,j, [Xi : Yj}j{ ^ etj for some Eij ^ 0, and define the operator 



lHj) 

iaj)jej 



LHI) 

i'Lj£j£ijaj)iel, 



then: 



{Xi:Yj}^\\\e\\\Al. 
3.3.2 Remark. On (IR+)^'*'^, |||£||| is a nondecreasing function of each eij. 



(GO 



(GD) 



As the proof of Theorem 3.3.1 is rather technical, I found it useful to write down 



how it goes on a concrete example. This is performed in Appendix \3.7\ which I suggest 
the reader to look at in parallel with the proof as a complement. 

To prove Theorem |3.3.1[ we will need the following 

3.3.3 Lemma. Let Xi,X2,...,Xn and Y be random variables, call Jl their natural 
a-metalgebra, and assume that for all i e {1, . . . , A'"}, 

{Xi:Y}ji^Ei. (GE) 

Let f be an L^(X) function. For all ^ i ^ N, denote ■= a(Xi,...,Xi), resp. ^* ■ = 
aiXi,. . . ,Xi,Y), and for all 1 ^ i ^ N, define 



fi--=f^'-E[f\^i-il 



(GF) 



resp. f* ■■= f"^* -E[f\^*_^l 
and denote by Vi and V* their respective variances. Then, for all l^i ^N, 



(GG) 



(GH) 



Proof For O^i^N, define 



fi:=f-E[fm], 



(GI) 



resp. f*:=f-E[f\^*], (GJ) 
and call Vi and V* their respective variances. One has fi = T.i'>i fv, resp. f* = T.i'>i f*- 



Moreover, by the same argument as in the proof of Proposition |3.2.1[ , all the fi are 
orthogonal (that is, io 7^ ii =^ E[/"jg/"jJ = 0), thus 



(GK) 
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similarly, 



(GL) 



i'>i 



In a first step, we observe that for all i, fi- f* = if - f^O-if - f^O = f^^ - f^' = 
f^i - {f^i)^i =if - f^i)^i = {fi)^i, which by associativity of variance yields the fol- 
lowing 

3.3.4 Claim. 



Vi-V* = Ya.r{{fif^). 

Now, the following claim will be the main tool for proving the lemma: 
3.3.5 Claim. For alll^i^N, 

\2r 



(GM) 



Vi-i-V*_,^ieiVv, + JV,-V*) 



[fl 



(GN) 



Admit temporarily Claim 3.3.5 Since Vn = = 0, (GNl applied with i = N gives 
Vn-1 - y^-i ^ ^N^N, which in turn we can use in (GN) with i = N -1, and so on, to 



finally prove by finite (decreasing) induction that, for all i, 



(GO) 



Now to get (GHl, we note that Vi = Vi-i-Vi, resp. V* = V*_^-V*, so, using successively 
the inequalities ([GN]) and (|G0|), 



Vi-V* =iVi-i-V*_^)-iVi-V*) 



which is equivalent to ( GH I 



^e% + 2eis/Vi{Y^Evs/v;>), (GP) 

□ 



i'>i 



Proof of Claim [37375] Thanks to Claim [373. 4[ what we have to prove is: 

Var((/^_i)-^.-i) ^ {eiVVi + JVi-V*f. 



(GQ) 

By the definition of conditional expectation and the equality case in the Cauchy- 
Schwarz inequality, (GQ I is equivalent to saying that for all L^i^*_-^) function g, 



\Wi-ig]\^ie,VVi + JVi-V;) Sdig). 



(GR) 



So let ^ be a centered ^*_-^ -measurable real function. Since fi-i = fi + fi, E[fi-ig] = 
^[fig] + ^[fig], which two terms we shall bound separately. 

For the first term, under P['|=^i-i], fi is centered and only depends on Xj, and g 
only depends on Y. Since {Xi : Y}^^^ ^ {Xi : Y]ji ^ e^, it follows that 



mfigWi-iW^Ei ^d{fiWi-i)mgWi-i\ 



(GS) 



f*^Taking the square root of (V; - Y*) is allowed, since that quantity is nonnegative by Claim 3.3.4 
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which yields upon integrating: 



\E[fig]\^ei J Sdifi\^i-i)Sdigm-i)dP 



CS 



Var(/-j|^j_i)cJPW Yar(g\^i-i)dP 



= Si VVix Varig) - Var(^-^-i) ^ ei VVi Sd{g). (GT) 



For the second term, under P['|=^i], g only depends on Y, and E[/"i|y] = /"^ - f* as 
we noticed just before Claim 



integrating: 



3.3.4 



so E[fig\J^i] = E[(fi-fng\^i], which yields upon 



I Eifig] \ = \E[ifi- f* )g] I ^ Sdi fi - f* ) Sdig) =^Vi- V* mg), (GU) 

Co 



the last equality coming from Claim 3.3.4 Then it just remains to combine ( |GT[ ) 
and ([gu]| to get (GRI. 



□ 



Proof of Theorem 3.3.1 First, thanks to a by now classical approximation argument 



we may assume that / = {1,...,N} and J = {1,...,M}. Denote ■■= aiXj), resp. 

:= aiYj); our goal is to prove that for all f e L^(^), all g e L^C^), one has 
|E[/"^]| ^ ( III £ III A l)Sd(f)Sdig). We will use the same trick as in our alternative proof of 
Theorem |3.2.2 by the definition of conditional expectation and the Cauchy-Schwarz 
inequality, proving the inequality above is equivalent to showing that for all f € L^(^), 



Var(/-^)^(|||£fAl)Var(A 
which, by associativity of variance, is in turn equivalent to: 

Var(/--/-^)^(l-|||ef)+Var(A 



(GV) 



(GW) 



For O^i^N, resp. O^j^M, define - aiXi, . . .,Xi), resp. '^j - (j{Yi, Yj). For 
all ^ j ^M, define 

fj:=f-E[f\'Sj], (GX) 

and for all 1 ^ i ^ AT, define moreover 

/■/ f^^^^' - Elfl^j V ^i-il (GY) 
Denote V-^ ■= Yarif-'), resp. W == Var(/"/). For fixed j, the /"^ are pairwise orthogonal 



(again by the argument in the proof of Proposition 3.2.1 1 and their sum is equal to f-' , 
so: 

N 



(GZ) 



i = l 



Thus, with this notation our goal ( [GW| ) becomes: 

AT 



j:vf'^a-\\\e\\\\f:v'- 



(HA) 
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The main tool to prove (HA) will be Lemma 3.3.3 Actually the rough formula ( GH I is 
quite impratical, so we introduce a linearized version of it: for each l^i^N take some 
aj > (which for the time being is arbitrary), then by the Cauchy-Schwarz inequality, 
(iGHll implies that: 



T7* -^/-i 2xT7 ^i^i V" V" ^i'^V 
V- ^{l-E-Wi l^Ei^ai'-Eiail^ 



(HB) 



3.3.6 Remark. ( [HB) is devised so that its right-hand side is exactly the same as in ([GH]) 

ifViOc Vi. 

' I 

Let us reason conditionally to '^j-i for a few lines. Under this conditioning, 
call #i := cr(Xi, ...,Xi), resp. == cr(Xi, . . .,Xi,Yj), and / = = /^"^ Then / is an L^iX) 
function, so we are in situation of applying Lemma 3.3.3 to the functions 

fi := /-^'-EL/l^i-i] (HC) 
and /; := J. (HD) 

But in fact we already know these functions: namely, fi - fj ^ and /* = f^^. Then, 



applying the linearized version (HB I of Lemma 3.3.3 



Var(/y ^ (l - 4 - ^ E e,,a,) Var(/y-H_0 - e,,a, E ^''^^^^^"''^"^^ 

V>i 



i'>i 



OCiJ 



whence upon integrating: 



a,; 



OCii 



(HE) 



(HF) 



By Equation ( |HF| ), we have transformed our initial problem into a purely abstract 
operator problem, posed in an setting. To handle it, we need a little notation. 
Call L^{I) the set of real functions on /, endowed with the norm 

\\iVi)iel\W--=Y.\^i\. (HG) 
iel 

The dual space of L^(/) is made of the linear forms I : (vi)iei Hhvi, equipped with 
the L°° norm 

||/||oo:=SUp|/i|. (HH) 

iel 

We shall write "LHD 3 v ^ 0" to mean that all the entries of v are nonnegative, and 
"(L'^il))' 3 I ^ 0" to mean that (v ^ 0) ^ (Iv ^ 0), which is equivalent to say that all 
the li are nonnegative. Now I claim the following lemma, whose proof is postponed: 

3.3.7 Lemma. Suppose given some nonnegative numbers V'^ for (i,j) e {1,...,A'"} x 
{0, . . . ,M}, such that Equation (HF\ is satisfied for all i,j. Call 5£ the nonnegative linear 
form on L^{I) defined by 

S^v= 2^ — £ij£i'jVi, (HI) 
jej 
i,i'el 
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and assume \\S£\\oo ^ 1.. then: 



(HJ) 



i = l 



Lemma [3 . 3 . 7| has the following immediate 

3.3.8 Corollary. Suppose given some nonnegative numbers for (ij) e {1,...,N} x 
{0, . . .,M}, such that Equation (HF} is satisfied for all i,j, then: 



(HK) 



i'el 



Now we finish the proof of Theorem 3.3.1 thanks to Corollary 3.3.8 we have proved 



that (HKl stands true in our situation for any choice of positive (aj)je/. The last step 



then consists in optimizing that choice. Denote "a > 0" to mean that all the at are 
positive. One has: 

inf sup y —eijEi'j - inf{A ^0: (Ba > 0)(Vi)( Y EijEvjai' ^ Xat)} 

«>0 iel jeJ «i jeJ 
i'el i'el 

= inf {A 3^0: {3a> Q){ee* a ^ \a)}. (HL) 
But EE* is a nonnegative operator on L^il) (I mean, when seen as a matrix all its 



entries are nonnegative), so by Lemma 3.8.1 in appendix: 



inf{A^O: (3a > 0)(££*a ^ Aa)} = p(££*) = |||£|| 



This ends the proof of Theorem 3.3.1 



(HM) 

□ 



Proof of Lemma 3.3.7 We prove Lemma 3.3.7 by induction on M. The case M = is 



trivial. Suppose M ^ 1 and assume the result is true for (M - 1). We generalize the 
notation by defining, for • G {^, 1, *}, 



a,' 



jeJ' 



(HN) 



with Ji = {l}, resp. J* = {2, . . . ,M}, so that^ = ^i + ^*. Notice that \\S^*\\^^1 since 
ll^lloo ^ 1. For all i e I, define 



Vl = (1 - 4)V,o ^ Eiw - Enai ^ 



(HO) 



notation, (HFI writes 



which is the value that Vj- would take if there were equality in (HFI for j = 1. With that 

(HP) 



and by induction hypothesis we have: 

£v,^^£y>-^*((y/),e/). 



(HQ) 
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Introducing the V^, we have therefore the following chain of inequalities: 



N 



N 



i = l 



N 



E y> + \\{Vl - Vl)i,j II 1 - ^*{{Vl)iel) - ^*{iVl - Vl)iel) 



i=l 



N 



N 



N N 

LV'-^'((V^^^eI)-^*(iV^)^eI)=J:V^-^((Vi'>^eI), (HR) 



so (H J ) is true for M, whence the lemma by induction. 



□ 



3.3.9 Remark. Our proof of Theorem 3.3.1 handled the Xi and the Yj in a fully non- 
symmetric way, since we began with putting orders on I and J, which orders played a 
crucial role in the decomposition of/". Yet the bound ( [GD| ) obtained is obviously symmet- 
ric by re-labelling the basic variables — and this is not due to having proceeded to any 
're-sjrmmetrization' step. . . To date I have no simple explanation for this 'coincidence'. 



3.3.b *Z against Z' tensorization 



The proof of the W against M' theorem was quite more technical than that of the 
W against 1' theorem; because of that, in order to get tractable computations we had 
to use suboptimal inequalities at two places: 

• Claim |3.3.5| is suboptimal: it has indeed the same shortcoming as Proposi- 



tion 3.2.1 exhibited compared to Theorem 3.2.21 namely, it does not 'recycle the 



losses' occurring when one makes g covariate with both fi and fi (cf the discus- 
sion on page |48| just after the proof of Proposition |3. 2. 1| ). 
• Our linearization technique is suboptimal in general, even after optimizing 
the at. In fact, as we said before. Inequality ( |HB| ) is optimal if and only if one 
has Vi (X at; thus, for (ipj) to be always optimal, one has to have cx ai for 
all j, with the same values for the at. This would imply that all the sequences 
(^/)o«i<n are proportional, which is not true in general. 

So, Theorem 3.3.1 is certainly not optimaf^ — this is confirmed by the example 
of § 3.7[ Nonetheless, there is one particular case in which an alternative reasoning 
5delds an optimal boun^^ This case is when some symmetries in the decorrelation hy- 
potheses allow us to transform the original two-parameter problem (indexed by 7 x J) 
into a one-parameter problem (indexed by Z). Let us state and prove the corresponding 
result: 

3.3.10 Theorem ('Z against Z' theorem). Let I and J he sets isomorphic to Z, and 
let {Xi)iei and {Yj)jej be random variables such that, M denoting the a-metalgebra 
they generate, one has for all i, j g Z 



{Xi:Yj}j,^eij-i) 



(HS) 



f^^Though, as we will see in § 3.5.b it is 'asymptotically optimal'. 



'^'"^The bound's being optimal shall be proved by Theorem 



3.5.3 
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for some function e: Z ^ [0, 1]. 
Then 

{Xi:Yj}^e, (HT) 

where £ g [0, 1] is characterized by: 

Arcsin^ = ( ArcsineC^)) a — . (HU) 

zeZ 2 



3.3.11 Remark. If we apply Theorem 3.3.1 to the situation above, we find {Xj :Yj} 



^ (Lzez^^(2)) ^ 1 (cf. § 3.6.bl. The latter expression is always ^ e because of the concav- 



ity of the function sin(' a |) on U+, and even > e if e 7^ 0, 1; so, when it is applicable. 



Theorem 3.3.10 is strictly stronger than Theorem 3.3.1 



Proof. Let f and g be resp. X/- and Yj-measurable functions. Denote ^ ■■= a{X), 
resp. ■■- a(Y), and for i e Z, resp. j g Z, denote ■■= Vi'si c(-X^i'), resp. '^j ■■ = 
\/j>^ja(Yf). For (i,j) G Z x Z, define 

fj f'^J^^' - E[f\%- V J^i-i] (HV) 

ml 



an 



g) := g'^J - Eig'^J V ^il (HW) 



Denote V : = VarC/), W ■■= Yar(g), ■■= Yarif-!), W' := Var(^^.); also denote 

^ ^ J J 

S,,:=E[/-/-V}"']. (HX) 

Our auxiliary functions were devised so that 
3.3.12 Claim. Provided the sum in the right-hand side is absolutely convergent, 

E[fg] = Y^Sij. (HY) 
ij 



Proof of Claim [373. 12\ First define f ■= f , so that f is "i^-measurable and E[fg] = 
Eifgl For je Z, define gj g'^J - E[g\'^j-il resp. fj ■■= - E[f\'^j-i]-^ we have g = 
T.jgj and f = T.jfj, which are the respective decompositions of g and f on the same 
basis of orthogonal subspaces of L^C";^), so E[fg] = T.j^ifjg j]- The terms of the right- 
hand side of that formula are unchanged upon replacing fj by fJ-^ ■■= f -E[f\^j-i], 
since E[(/"-^~^ - fj)gj] is zero — the function ifJ'^ - fj) is indeed equal to (/ - Wf\'Sj^), 
which is centered conditionally to '^j, while gj is "S^^ -measurable. In the end we have: 

EVfg} = Y^EVf^-^gji (HZ) 

j 

So in a first step we have decomposed E\_fg] into a sum indexed by j. Now we 
decompose each term of that sum into a sum indexed by i. Let us reason conditionally 
to "Sj-i. Then f^''^ is an L'^i^) function and gj is in L'^iYj). We compute E[/"^- Vjl 



'^"^Beware: the definition of g''- is not analogous to the definition of f'! ! 
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as in the first step of this proof: first we replace gj by gj ■■ = igj)^', then we decompose 
f'-^ = Lift^ Sind gj = Zigji, with := ifj-^)^^-E[fJ-^m.i^ resp. gji ■■= gf - 
E[g j\^i-i], and by orthogonal decomposition we get Eif-^'^gj] = ^ifi ^Sji^', then we 
conclude by saying that Elf'! ^gji] is actually equal to Elf'! ^gy^], since ig^'^ is 
centered conditionally to while ^ is -measurable. In the end we have obtained 

E[f'-^gj] = Y^E[ff^g\r\ (lA) 



which combined with (HZ I yields (HYI. □ 



So we have expressed E[fg] as a function of the Sij. It is also possible to 'read' the 
values of y and W from the V- , resp. from the W\ via the formulas: 

V = lim (E^/); (IB) 
Now we are looking for relations between the Vr', the W and the Sij. The first 



relation comes from the decorrelation hypothesis: conditionally to 'Sj-i v ^ is 



in L'^iXi), resp. ^ is in L^CY,), and {Xi : Y,} ^ £(j - i), so: 



The second relation means that a large value of \Sij\ forces Wj to diminish. To state 
it, we observe that, since ff^ is (^^y-i v J^j)-measurable, Sij = Eif^'^igy^fj-^"^'], 
so by the Cauchy-Schwarz inequality \Sij\ ^ Sd(/"/"bSd((^^-^)^^-i^-^')- Moreover, 
since g^^ -ig'j~^)'^J-^''^' = g^ one has by orthogonality Var((^^."^)*i-i^-^') = Var(^l"^)- 
Yarigj), so our inequality becomes 



\Sij\^\/vr \/wr'-w] (IE) 



J J 



(where it is understood that Wj ^ Wj ), or more eloquently 



Wj^Wj-^-(S^//v/"^ (IF) 



provided V. ^ > 0. 



The third and last relation means, on the other hand, that a large value 
of \ T.i'>iSi'j\ forces T.i'>iVi to diminish. To state it, we denote 



fJ:=f.f'^.^^. = Y.fJ^^ (IG) 

i'>i 



'^*^Notation is consistent: this ^ is indeed the same as the ^ defined by (HVi, since we are reason- 
ing conditionally to ^j-i- 
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whose variance is Yar(f'l) = Li'>i Var(/"^:',) since the /"^^ are pairwise orthogonal. One has 

LSi'j = L^^ff'g'j-'^ = I E[/^^- V}] = nff'g)] = mft^f^'-^^g)], (ih) 

i'>i i'>i i'>i 

SO by the Cauchy-Schwarz inequality, 

I E s,j\ ^ sd((/y-b^^-^osd(4). (II) 

i'>i 

Since f-~^ - {fJ~^)'^j^^i = fj^ one has by orthogonality 

Var((/y-b^^^^' ) = VarC^-^ - YMfj), (I J) 

so our inequality becomes 




(IK) 



or more eloquently: 



Ev^^^Ev^" -(Es,,)Vw;. (id 



So, we have transformed our initial probabilistic problem into the following ana- 
l5^ic one: let A be an array indexed by Z x Z, each entry (?, /") o f which contains three 



numbers ^ 0, Wj ^ and Stj, satisfying (ID I, (IE I and (IK) — we will say such an 
array is correct. We define y by (IB I and W by~iIC[f7and we set S - T.ijSij (provided 



it makes sense); our goal is to get a bound of the form "\S\ ^ eVVW", with e only 
depending on £(•). 

Note that A priori some problems of summability can arise from A's being infinite. 



for instance to check ( |IL[ ) or to define S. However, in the situations which are of interest 
to us, we can restrict to cases in which A is of nice particular form. To do this, we 
first approximate f in L^iX), resp. g in L^(Y), by a function depending only on a finite 
number of X^, resp. of — say, we assume f is X^-measurable and g is -measurable 
for finite I <^I,j c J. Then, we define a new model (Xi)iez,iYj)jez by Xj = Xi for i el, 
resp. Yj = Yj for j e J, and Xi,Yj = d for i t I,j tJ,d being some cemetery point. This 
new model still gives a correct array, for which SI\/VW is arbitrarily close to the initial 
value of E[/"^]/Sd(/")Sd(^); and the new array is of the following form, which we will 
call compact, for which all the quantities of interest are well defined: 

• Vr' is zero as soon as i ^ /, and it does not depend on j for j < min j, nor for 
j ^ max j ; 

• Similarly, Wj is zero as soon as j € J, and it does not depend on i for i < min/, 
nor for i ^ max I; 

• Sij is zero as soon as (i,j) €l x J. (This condition automatically follows from the 
first two if the array is correct). 

We define the following operations on arrays: 
3.3.13 Definition. 

• For 2 G Z, we define the translation operator on arrays such that, if the entries 
of A at ii,j) are V/,Wj,Sij, the entries of t^A at ii,j) are v/^;, 
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• For A and A two arrays with entries V"/, resp. Vr,etc., for a,^ two real 
numbers, we define the linear combination aA + j6A as the array with entries 
ay/ + aW + pW,etc.. 
3.3.14 Lemma. Correct arrays are stable by translations and by nonnegative linear 
combinations, i.e., if A and B are correct arrays, then for all z eT and a,l5 ^ 0, t^A 
and aA + j6B are correct too. 



Proof of Lemma \3.3.14\ Recall that being correct means satisfying ( |ID| ), ( |IE[ ) and ( |IK[ ). 
These conditions are trivially stable by multiplication by a nonnegative constant and 
by translation It remains to see that they are stable by addition. The technique 



being the same for all three inequalities, we just treat the case of (IE I. Stability of this 



condition by addition is a consequence of the following inequality (which is in fact a 
particuliar case of the Brunn-Minkowski inequality, see 1.17,1 ): 

3.3.15 Lemma. For all 01,61,02,62 ^ 0, 



\/(ai + a2)(6i + 62) ^ Voibi + \fa^2- 



(IM) 



Proof of Lemma 3.3.15 Take squares on both sides of (TM]) and notice that (01 + 02) 
(6i + 62)-(\/ai6i + \/a262)^ = ai62 + a26i-2v/ai6ia262 = (\/ai62- s/a^)^ ^0. 

n 



\ 61 = W.-'^ - W-M = W'-'^ - W, we get: 



For A and A two correct arrays satisfying (IE I, applying (IMl with ai-V^ ,02 



\Sij + s,j\^ \Sij\ + ^ V^" V^"' - ^ + V^" V^"' - ^ 



so (IE) is still vaUd for (A + A). 



^ \/ v/"^ + v/" Y(^j"^ + - + wj), (IN) 



□ 



Now, thanks to Lemma [3.3.14| we will reduce our problem on (Z x Z)-arrays into a 
problem on Z-arrays. Suppose A is a correct array with certain values of y, W and S. 
Then, for k^Q, the array 

1 * 
A* = E T^A 



2k + 



(10) 



z=-k 



is correct too, with the same values of y, W and S as A. Now when k ^00, Ak 'looks 
more and more like a Toeplitz array', that is, an array whose entries at ii,j) only 
depend on (j- i). To state it rigorously, we need some definitions: 

3.3.16 Definition. 

• Here, a Toeplitz array will mean a Z x Z array whose entries at (i,j) only depend 
on ij- i). For such an array, for 2 G Z we denote by V{z),W(z),S{z) the quantities 
characterized by W = V(j-i), etc.. 



'^^^ Getting stability of Condition \id\ by translation is actually the only place where the sjmimetries of 
the problem are used. 
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• Actually we can always assume our Toeplitz array is Toeplitz compact, which 



means that there exists some z ^z'^ such that: 



- V(z) does not depend on z for z <z~, nor for 2 ^ 0" 

- W(z) does not depend on ^ for ^ ^ 2~, nor for z > z'^; 

- S(z) is zero as soon as 2 < 2~ or 2 > z"*". 

• For a compact Toeplitz array, we define v,w,s as 'renormalized versions' 
oiV,W,S: 



V 

w 
s 



V{z<z-)', 



(IP) 
(IQ) 
(IR) 



• A Toeplitz array is said to be correct if it is correct when seen as an ordinary 
array For a Toeplitz array, Equations (ID I, (IF I and (ILl become respectiveljp^ 



£iz)\JV(z-i)W(z+iy, 



z'<z 



(IS) 
(IT) 
(lU) 



With that vocabulary, our informal statement can be made precise: let A be a com- 
pact correct array with entries ,W\Sij, and associated quantities V ,W ,S, and de- 



fine the arrays A^ by (10 1. Then when k ^00 one has {2k + l)Ay^ A (in the sense that 
each entry of {2k + 1)A^ converges to the corresponding entry of A), where A is the 
Toeplitz array with entries Vj^,W\Sij defined by: 



J-l=Z 



J-l=Z 

j-i=z 



(IV) 
(IW) 
(IX) 



This array A is Toeplitz compact with z~ 
the quantities ( IP l-( IR I for A are: 



V 

id 
s 



min j-max/, resp. 2^ = max j-min/, and 



= V; 
= W; 
= S. 



(lY) 
(IZ) 
(JA) 



Moreover A is correct, because all the {2k + l)Ak are, and being correct is clearly con- 
served by array convergence. 

The consequence of this statement is the following claim, which achieves the re- 
duction to a 'Z-indexed' problem I alluded to a few lines above: 



^*^Note that the way (|lU| follows from ( IL 1 is rather tricky, because it appears a difference between 



two infinite quantities, which has to be 'renormalized' in the convenient way. 
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3.3.17 Claim. The supremum of \S\I\/VW for correct arrays is not greater than the 
supremum of \s\l \/vw for correct Toeplitz arrays. 

So we have to study (compact) correct Toeplitz arrays. Consider such an array 



Denote 6{z) := Arcsine(2); then ( IS I can be rewritten 



3e{z)E[±B{z)] S(^) = sin0(2)-^y(,_i)W(^+i). (JB) 
Now, notice that for fixed values of the V(z), the S(z) and w, if we have values W(z) such 



that (ISl-(IU) are satisfied, we can modify those W(z) so that (IT) becomes an equality 
for all z, an operation which keeps ([IS]) and dTub true since it can only make the W(z) 



increase. So we can suppose that ( IT I actually is an equality, i.e. that for all 2 g Z, 



W(z) = w n cos^kz'). 



Then it remains to integrate (lUI. For 2 G Z, denote 



r(2):= ^ (sin0(2')- n COmz")-JV(z'-l)), 



(JC) 



(JD) 



so that (lU) becomes 



V-(,_i)^i;-r(2)^ 
r(') satisfies the recursion equation 



Hz + 1) = sine{z\ V(z-i) + cos0(2)r(2). 



so by ( JEl: 



\T{z + 1)1 ^ sin \e{z)Wv - r(2)2 + cose{z)\T{z)\. 



From ( JG I, we will now prove that for all 2 G Z: 



ir(2)i^sin(^A ^ e{z'))V^. 



z'<z 



(JE) 



(JF) 



(JG) 



(JH) 



Indeed, (JHI is equivalent to saying that there exists some 77(2) g [O,Xz'<2 0(2')] such 



that |r(2)| = sinri{z)y/v, which we prove by induction. First, since our Toeplitz array 
was supposed compact, V2 < z~ 6{z) = 0, so the formula is true for 2^2" with 77(2) = 0. 



Next, if the formula is true for 2, then ( |JG[ ) yields 

|r(2)| ^ (sin|0(2)| cos 77(2) + cos \diz)\ sin 77(2)) = sin(77(2) + \kz)\)Vv, ( JI) 

where 77(2)+ |0(2)| ^ Lz'<zOiz') + Oiz) = Zz '<z+id(^'X so the formula is true for (2 + 1), 
which ends the induction. 



To conclude, we write that s = T.zS{z) = r(2 >z'^)\/w. But by (JHI, \Tiz > z'^)\ ^ 
sine - s/v, so in the end: 

|s| ^ sine • i/iw, (JJ) 



quod erat demonstrandum. 



□ 
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3.3.18 Corollary ('Z" against Z"' theorem). Let n ^ 1; let iXx)x£Z"' and iYy)yez"' be 
random variables, and assume there exists a function e : Z" ^ [0, 1] such that for 
all x,ye Z", 

{X^:Yy}j^^e{y-x), (JK) 

^ being the natural a-metalgebra of the system. Then {X :Y} ^e, where e the number 
in [0, 1] such that 

Arcsin(£) = ( ^ Arcsine(f;)) a — . ( JL) 

Proof. To alleviate notation, we define the 'arcsin-sum' as the binary operation + : 
[0,1]^ ^[0,1] defined by: 

a+6 = sin((Arcsina + Arcsin6) A — ). (JM) 
+ is associative, commutative and nondecreasing, so it can be extended into an oo-ary 



operator X; with this notation, ( |JL[ ) merely writes e = T.v£Z"-£iv)- 

Let (ei,...,e„) be a Z-basis of Z". For 1 ^ r ^ /i, we identify Z'' with Zei ® Ze2® • •• ® 
Ze^; we also denote Z-jr ■■= Ze^+i ® • • • ® Ze^- What we will prove is actually the following 

3.3.19 Claim. For all re{l,...,n}, all x,ye Zjr, 

{Xx+z'- : Yy+z'-}ji ^ Y.vez'-^^y -x + v). ( JN) 



The statement of the lemma then corresponds to the claim for r -n. 



We prove Claim |3.3.19| by induction on r. The case r = 1 is merely Theorem |3.3.1C 
Now let us show how to go from the case r - 1 to the case r for r > 1: 



§] 



Take x,yeZ-^. We notice that 

Xx+Z'' = iXx+ier+Z''-^)ieZj 



(JO) 



which we shorthand into Xx+z'' = (Xi)iez; similarly we write, with obvious notation, 
Yy+z'- =■ (^j)jez- By induction hypothesis one has for all i,j e Z: 



{Xj :Yj}ji ^ Hvez'-^'^^y - ^ + 0' - i)Gr + v). 



(JP) 



Since the right-hand side of ([JPJI only depends on (j -i), we can apply Theorem |3.3. 10 
to the Xi and the Y^, which yields 

{Xx+Z'- ■■ Yy^jr\j^ ^ Ezez(Et;eZ'-i^(3' " X + 26^ + v)) = Y.veZ'-'^^y ' X + v), ( JQ) 

i.e. (fjlb. □ 



^^More precisely, it is the subjective version of that theorem, cf. § 



3.4.b 
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3.4 Generalizations of the tensorization results 



3.4.a Minimal Hypotheses 



When reading the proofs of the tensorization theorems, you may have noticed that 
taking the decorrelation hypotheses w.r.t. the whole cr -metalgebra of the system was 
a needlessly strong assumption. Actually each decorrelation hypothesis can be stated 
relatively to only one cr-algebra, in the following way: 



• For Theorem |3.2.2[ one needs only assume that for all i € I, Xi and Y are ei 
decorrelated when seen from o"((Xj')j'<j); 



• For Theorems 3.3.1 and 3.3.10 one needs only assume that Xi and Yj are eij- 
decorrelated (or e(j - i)-decorrelated) when seen from a{(Xit)it<i,(Yf )/<])■ 



In practice it is rare that one can bound above {Xi : Y} 



or{Xj:y,}/- - 

{i'<i} ^^{i'<i}'^u'<J} 



) 



more sharply than {X :Yi}j^, resp. {Xi :Yj}^; yet it is worth remembering that the 
'genuine' decorrelation hypotheses are weaker than those we wrote, especially when 



one gets interested in optimality issues (cf § 3.5) 



3.4.1 Remark. In our tensorization proofs we took / and J finite; yet those proofs, 
and therefore everything in this subsection, remain valid if we take for I or J any 
(countable) well-ordered set, in particular if / or J is 



3.4.b Subjective versions of the theorems 

In the tensorization theorems I stated, the decorrelation hypotheses were given with 
regard to the natural cr -metalgebra ^ of the system, while the results were given 
in terms of 'objective' (I mean, not subjective) decorrelations. Yet actually it can be 
shown that our results are still valid w.r.t. ^ — or even w.r.t. any sharper cr -metalgebra 
jy 3 provided decorrelation hypotheses are stated w.rt. J/. As an example, let us 



state and prove the subjective result corresponding to Theorem 3.2.2 



3.4.2 Corollary. Let X, (Yi)iei and iZg)geQ be random variables, and call JV the a- 
metalgebra they span. Suppose we have bounds {X : Fj}^ ^ ej for all i g /; then: 



{X:Yi}^^Jl-Y\il-Ep. (JR) 

iel 

Proof. Up to making up copies of / and 0, we can assume that {0}, / and are disjoint, 
which allows us to denote Zq-=X and Zj --Yi for iel, so that JY is the cr -metalgebra 



spanned by the Zq for g := {0} i+J / i+J 0. Then ( |JR| ) means that for all He©, for 
(almost-)all 2h, one must have: 



{X:Yi}^ Jl-Wil-el) under the law P[-|Zh = 2h]- (JS) 

iel ' 



So, Corollary 3.4.2 will ensue from Theorem 3.2.2 provided we can prove that, denoting 



by M the cr-metalgebra spanned by X and the Yi, one has for all iel: 

{X:Yi}ji^Ei under the law P[-|Zh = ?h]. (JT) 
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But under a law P , saying that {X :Yi\j^ ^ £j means that for all Y c {0} i+J 7, for 
(almost-)all 2^, one has {X : Y^} ^ Ei under the law PblZy = z'^l So, for P = P[-|Zh = 
z^], ( JTl means that, for all z'-^: 



{X : Yi) ^ Si under the law P[-|Ze = ze and Zy = z'-^l (JU) 



In Formula (pu|l we can assume that zg and 2^ coincide for all g Sn Y, since otherwise 

Y 



the event "Z^ = 2^ and Zy = Zy" would be empty and there would be nothing to say 



Then "Z^ = ze and Zy = Zy is of the form "Zhuy = ^Huy", where E u Y c 0, so that 



( JU) follows directly from the hypothesis {X : Yi]j{ ^ tj. □ 



3.5 Optimality 



3.5.a Exact Optimality 



With the minimal hypotheses stated in § 3.4.a Theorems 3.2.2 and 3.3.10 are optimal: 



3.5.1 Theorem. The bound in Theorem \3.2.2\ is optimal, in the following sense: for 
any integer N, for all (ei)i^i^Ar in [0,1]^, one can find random variables Xi,...,Xn,Y 
such that for all ie{l,.. .,N}, 

(JV) 



{Xi-.Y}- =Ei 



and 



{X:Y}=ll-W{l-e1). 



(JW) 



3.5.2 Theorem. The bound in Theorem 3.3.10 is optimal, in the following sense: 
for any integer N, for all {£iz))-Ns:zs:N ^ [0, one can find random vari- 

ables {Xi)iez and (Y^Oyez such that for all i,j e Z, 



(X ^ e(j-i) if\j-i\^N; 

■ ^Pix^,,,^^,Y^J,,J^) \ Q if\j-i\>N 



(JX) 



and {X :Y} = e, with e defined by: 



N 



Arcsine= ^ ArcsineC^); 



n 



(JY) 



z=-N 



Actually, as proving Theorem |3.5.2| for all the (e(2))-Ar^z^iv involves some heavy 
technicalities fSB], I will only prove the slightly weaker following 

3.5.3 Theorem. For any integer N, the exists a neighbourhood UofOin [0, i]f~^' - '^' 
such that, for all (e(z))-NsizsiN ^ U, one can find random variables (Xi)iez and (Yj)jez 
satisfying and (OT^ 



3.5.4 Remark. On the other hand. Theorem 3.3.1 is obviously not optimal since, as we 
pointed out, its bound is strictly weaker than that of Theorem |3 . 3 . 10} 



'^''l^Notice that in the neighbourhood of 0, one can drop the "a |" in the right-hand side of ( JY 
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The proof of Theorem 



3.5.1 



relies on the following important result: 



3.5.5 Lemma. Let iXi,...,X]si,Y) be an {N + 1)- dimensional Gaussian vector. For 
alll^i^N, define 

ei:={Xi:Y}- , (JZ) 

^{i'<i} 

then one has exactly: 



{X:y}= /l-Od-ef). (KA) 



3.5.6 Remark. Maximal correlation, as I told in § [Tj is fundamentally a Hilbertian 
concept. When one deals with Gaussian vectors, the Hilbert spaces involved actually 



have finite dimensions, so that Lemma 3.5.5 about decorrelations can also be seen as a 



result about Euclidian spaces. In Appendix 3.9, 1 will present an unexpected corollary 



of this lemma, stating a geometric property of the 3-dimensional Euclidian space. 



Proof of Lemma |3. 5. 5| To alleviate notation, we denote ^i-i ■= cr(X{j'<i}). Since {X,Y) 
is Gaussian, the law of(Xi,Y) under P[-|xi,. . . ,Xj-i] is Gaussian and only depends on 
(xi,...,Xj-i) through an additive constant; consequently, we can speak of "the Hilber- 
tian correlation between Xi and Y conditionally to ^i-i", which is ei, and also ofthe 
conditional variance of Xj w.r.t. ^i-i", resp. "the conditional variance of Y", resp. "the 
conditional covariance of (Xj,F)", which we denote resp. Var(Xj|=^j_i), Yar(Y\^i-i), 



CoyiXi,Y\^i-i). By Theorem [LOj one has: 

Cov(Xi,y|^i_i) = ±eiSd(Xi|^i_i)Sd(F|^^-i). (KB) 

Now take giY) = Y and fiX) = Z^^^PiXi, for some j6; £ to be chosen later. Then 



is equal to Y-E[y |^j_i] and fi is proportional to Xj-E[Xj thus, by (KB I and 



our model's being Gaussian, all the inequalities until ( |FU[ ) in the proof of Theorem 3.2.2 



actually are equalities for Ei = ei. If moreover Cov(/"j,^' ^ for all i, then we 



can drop the absolute values in their left-hand sides, and thus ( [FV| ) will also be an 
equality. Then, to get an equality in ( |FW[ ), it just remains to ensure that the final 
Cauchy-Schwarz equality is an equality, i.e. to ensure that one has, for all i: 

i-l 

Var(/-i)oc e2f](l-e?,). (KC) 



If all of that is satisfied, then one will have exactly E[/"^] = Jl-nid-e )Sd(/")Sd(^), 



so that {X -.Y} ^ \ 1- Wiil - e ). The converse inequality being obviously true by (the 



minimal version of) Theorem 3.2.2 the result will follow. 



So, we have to check that the choice of the Pi can be performed so that (|KC]l is sat- 
isfied, with CoYifi, g''~^\^i-i) of the good sign. To do this, we will choose successively 
relevant values for j6iv, Pn-i, ■ ■ ■ , 

We observe that, if PN,---,Pi+i have already been fixed, then Pi 
Covifi,g''~^\^i-i) is an affine function with slope 
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Moreover, Var(/"i) = Var(/"i|^i_i) as fi is centered w.r.t. ^i-i; so, since ft oc Xj 



E[Xi|^i_i], (KB I implies: 



VarC/i) 



efVar(y|Jfi_i) 



(KE) 



So, provided all the three quantities ej, Sd(F|=^j_i) and Sd(Xj|^j_i) are nonzero, there 



exists a (unique) satisfying (KCl 



Now if Sd(Fl^j-i) is zero, this means that Y is =^i-i-measurable; then one of the e^' 
has to be 1 and thus the result is trivial. Next if Sd(Xj|^i_i) is zero, this means that 
Xi is ^i-i -measurable; then 6^=0 and fi = 0, so that (|KC]| is automatically satisfied. 
Finally if ej = and Var(Xj|^j_i) > 0, then there exists a (unique) Pi such that fi = 0, 
for which dKCl) is satisfied. So all those particular cases actually work fine too. □ 



Proof of Theorem 3.5. 1 For technical reasons, we begin with noticing that the theorem 
is immediate if some ei is equal to 1, so that we can assume that all the ei are < 1. 
Thanks to Lemma |3.5.5[ it suffices to prove that for any sequence of Ei g [0, 1) it is 
possible to build a Gaussian vector (X,Y) for which ej = £i Vi. To do this, let • • ■ ,CiV 

\/l - aid + some parameters 

£] this is always possible, by the 



be i.i.d. ^(1) variables, and take Y = ^ and Xi 
Ui € [0, 1). We want to choose the such that e(a) 
following method: 



First we compute ai: By Theorem |1. 2. 6[ one can write down the equation link- 
ing ai and ei. It is clear without knowing the precise form of that equation 
(actually, ei = ^/al) that ei is a continuous increasing function of ai with ei = 
for ai-0 and ei = lforai = l. Therefore there is a unique ai such that ei = ei. 
Then we compute a2: As we already know the value of ai, we can treat it as 
a constant and look for the equation linking a2 and ea, which we compute by 



Theorem 1 1 . 2 . 6 1 again . That equation, though more complicated than in the previ- 
ous case (actually, 62 = y^vTr^^/vT^-oiOa), exhibits the same behaviour: 62 
is a continuous increasing function of a2 with e2(a2 = 0) = and 62(0:2 = 1) = 1. 
Therefore there is a unique a2 such that 62 = £2- 
We carry on this process until having determined all the ai. 

□ 



Proof of Theorem 3.5.3 Again, the principle of the proof will consist in showing how 
the optimal bound can be attained for relevant Gaussian vectors and linear functions 
of them. 

We consider independent JVil) variables (^j)jez and (cJijXijyeZxZ- For all i we set: 

N 

Xi= ma+z), (KF) 

z=-N 

resp. for all j: 

N 

Yj = ^j+ E (^z^ij-z)j (KG) 

z=-N 

for some real parameters iaz)-N^z^N to be fixed later. This model is obviously invari- 
ant by translation of the indexes. For 2 g Z, define 

CoY{Xi,Yi+,\^i.iv'^i+,.i) 



SdiXi\^i-i V ^i+,-i) Sd(Fj+, l^j-i V ' 



(KH) 
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where the choice of i does not matter. Since our model is Gaussian, by Theorem 



1.2.6 



{Xi:Yi+,}(- ^ ^ = |e^|. (KI) 

By the properties of Gaussian vectors, it is possible to write down explicitly the 
equations linking the to the Uz- Though these equations may be quite horrendous, 
some of their properties can be easily established: 

3.5.7 Claim. 

(i) For \z\ > N, 62 = (for any choice of the Uz); 

(ii) The map (a-jv,---,aiv) '-^ (e-jv,---,eAr) is of class '^'^ on the neighbourhood 
o/"(0, ...,0), with: 

' dcz \ ~ ly = 

-^(0)=-^^. (KJ) 
\dayj ^/2N+1 

By the inverse function theorem, one can therefore find neighbourhoods V and C7 of 
in [fjf-^' - '^J such that the map a e is a 'i^^-diffeomorphism from V onto U. In 
particular, for e in such an U we can always fix the az of our model such that 



li^i ^]S[£{z), so that ( JXl is satisfied. 



Now we have to choose f and g. Morallj^ we have to take the functions f and g 
having maximal Pearson correlation. Since the model is Gaussian, these functions will 
be linear, and since the model is invariant by translation, they will likely be invariant 
by translation too. So we would like to take, formally, f{X) = T-iez^i and giY) = 
T.jezYj- As such functions are not properly defined, we will rather consider f[k]iX) = 
T.^^_kXi, resp. g[k]iY) = Yj, and then we will let k tend to infinity. 

For these f[k] and g[k], define the VikY-, the Wikf- and the S[k]ij as in the proof 
of Theorem 3.3. 10[ which are gathered into the array A[^]. The following properties of 



the A[^] follow easily from the structure of our model: 
3.5.8 Claim. 

(i) All the V[ky. ,W[kY., S[k]i j are bounded uniformly in i,j,k. 

(ii) • V\k}^^ is zero as soon as i t{-k -2N ,. . .,k}; 

• WVkVj is zero as soon as j t{-k,...,k}. 
(Hi) S\_k]ij is zero as soon as \j -i\> N. 

(iv) • For -k ^ i ^ k - 2N, VikY. only depends on (j - i), even when k varies. We 

denote its value by V(j-i). 

• For -k ^ k, W[k]j only depends on (J - i), even when k varies. We denote 
its value by W(j-i). 

• For -k ^ i ^ k - 2N and -k ^ k, S[k]ij only depends on (j - i), even when 
k varies. We denote its value by S{j-i). 

(v) • V{z) has some constant value V for z < -N; 

• W(z) has some constant value w for z > N. 



By Claim 3.5.8, A[k] converges pointwise to some compact Toeplitz array A, whose 



entries are the V{z),W(z),S{z) introduced at Item (ivi of the claim, whose values v and w 



'^"^I say "morally" because nothing ensures that the supremum (AGi would actually be a maximum 
here. 
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are those introduced at Item (jv|l, and whose value s is Z^_jv'S'(z)- AH the arrays A[k] 
are obviously correct since they correspond to true functions, so by passing to the limit 
A is correct too. 



Since our model is Gaussian, all the inequalities ( |ID[ ), (IE I and (IK I are actually 



equalities for the arrays A[k]; moreover, since the are nonnegative, the S[k]ij are 



nonnegative. By letting k tend to infinity, it follows that all the inequalities ([TS])-((TU]) 
are actually equalities for the array A, with the S(z) nonnegative. Consequently in ( JB I 
one has d(z) - 9(z), and all the further inequalities are actually equalities, so that in 



the end ( J J I becomes 



'VW 



(KK) 



Now, defining V[k], W[k] and S[k] by resp. (IBI, (ICI and (HYI for the arrays A[k], 



Claim |375.8| shows that, when k ^ oo, V[k] ~ 2kv, resp. W[k] ~ 2kw, resp. S[k] ~ 2ks, 
so (|kkJ implies that S[k]/VV[k]W[k] e. But recall that VVk],Wm and SVk] are the 
respective variances and covariance of the functions f[k] eL^iX) and g[k] eL^iY), so 



by the very definition ( AG I of Hilbertian correlations, 

- - S[k] 



(KL) 



VV[k]W[k] 

Making ^ ^ oo, it follows that {X :Y}^e; the converse inequality being obviously true 



by (the minimal version of) Theorem 3.3.10 this proves Theorem 3.5.3 



□ 



3.5.9 Example. In this example we will carry out explicit computations for a Gaussian 
model close to the model presented in the proof above. We take independent J\^(l) 
variables Co, Ci,---, ••■,C-i/2,Ci/2,C3/2---, ...,w-i/4,a>i/4,W3/4,..., and we set 



Xi = (i + x/a(wj_i/4 + coi+y^), 



(KM) 
(KN) 



resp. I j 

for all integer i, resp. all half-integer j, where a is some arbitrary nonnegative pa 



rameter. We are going to show that for this system (HUl is actually an equality, in 



accordance with the proof of Theorem |3. 5 . 3 
For half-integer z denote 



iXi : Yi+2:}(x , Y , 



)' 



(KO) 



Bz for 



where the choice of i does not matter by translation invariance. Clearly e _ 
all z and e z = for \z\ > 1/2, so to know all the the only nontrivial computation is 
computing ei/2. Let us perform it. 



Since everything is Gaussian, by Theorem |1.2.6[ ei/2 is the value, under the law 
P[-|X{i<o},?o<i/2} = 0]>of 



|E[XoFi/2]|/Sd(Xo)Sd(yi/2). 



(KP) 



Under the law P['|-X'{j<o},F(;<i/2) = 0], it is clear that Co,^^i/4;Ci/2,<^3/4, ■ • • have exactly 
the same (joint) law as under P, and that w_i/4 is still independent of these (joint) 
variables, though its variance shall have diminished. So we need only compute 



V := Var(w-i/4|X{j<o},Fo<i/2) = 0)- 



(KQ) 
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Denote Lr == (. . . ,X-2, ^-3/2,^-1,^^-1/2), resp. Li := (...,X-2,F-3/2,X-i). We write 
that (formally) 

dP[Lr = and co-y^ = x] oc e~''^''^"dx, (KR) 

and also dP[Li = and co-3/4 = y]oc e ^ dy by translation invariance. But under P[- 1 
Li = and a>_3/4 = y], the law of (^-1/2,^1/4) is the same as under P, so one has: 

e"""'^^^ oc dP[Lr = and oj-y^ = x] 

= / dydP[L\ = and co-3/4 = y]<iP[i^-i/2 = and a»_i/4 = x\L\ = and lo-m = :v] 

oc I (iP[y_i/2 - and (y-1/4 = x|Li = and (jO-3/4 = y]e~-^ '"^^ dy 
Jy 

= I (iP[(^_i/2 = ~n/«(''c + 3') and a>_i/4 = x|Li = and a»_g/4 = 3']e~^ ''^^dy 

Jy 



[ e-"^'^y^"%-^"%-y"^^"dy<xexp{(l + a-^^)-}, (KS) 
Jy a + l/i; 2 



oc 



so that V must satisfy: 



1 + a 



a 



a + l/v v' 



whose only nonnegative solution is 



2a 



(KT) 



(KU) 



So one has_Sd(Xo|Li = 0) = Vl + av + a = (^1 + 4^ + l)/2, Sd(ri/2|Li = 0) = Vi + 2m 
and E[XoYi/2|Li = 0]-a, so that in the end (KP I yields: 



ei/2 



2Vl+2a 



(KV) 



With this value, Theorem 3.3.10 states that one has necessarily 



{X : y} ^ sin(2Arcsinei/2 



[*] 



2ei/2\/l-e 



■1/2 



2a 
l + 2a' 



(KW) 



gikW) 
that 



-k-1/2 



We show that (Km is actually an equality: take indeed f[k]iX) ■= T-i^iXk, resp. 



Yk, then Yarifik]) 



Var(^[^]) 

l)a k^oo 



k{l + 2a) and E[/"^] 
2a 



{2k - l)a, so 



(KX) 



k{l + 2a) l + 2a 

3.5.10 Remark. One can formally set a = +00 in the previous example, which actually 
means that one takes Xj = (yj_i/4 + (Di+1/4, resp. Yj = wj-n^ + w^+1/4. In this case, both 
Formulas (|kv|| and (|kx|) 'pass to the limit', yielding ei/2 = 11^2 and {X : Y) = 1. This 
shows that it is possible indeed that the Bz have 'mild' values and that yet X and Y 



are fully correlated. In other words, the "a?" in (HUl is not an 'artifact' of the proof of 



^*^As here one always has ei/2 ^ l/\/2, we can drop the "a|" of Formula (HUi. 
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Theorem 3.3. lC|[fl but the expression of a real 'phase transition' phenomenoiT^ Such 



[fl 



a phase transition did not occur for the simple tensorization formula ( |FM[ ), which shows 
that double tensorization in intrinsically more complicated than simple tensorization. 



3.5.b Asymptotic optimality 



In the previous subsection we saw that (the minimal versions of) Theorems 3.2.2 



and |3. 3. 10] were optimal, while Theorem |3 . 3 . 1 1 was not. However it turns out that that 
result is nevertheless 'asymptotically optimal', in the sense that the bound it gives is 
equivalent to the optimal bound when the correlations between the variables become 
weak. Here is a precise statement: 

3.5.11 Theorem. Let / = {!,.. .,N} and J = {1, . . . ,M} be finite sets, and define the func- 
tion Opt: [0,1]^^=^ -[0,1] 63/ 



Optiei.j) := sup{{X7 : Yj} ; (V(i,j) e I x J){{Xi : Yj]^- , y , )^ ^iJ^^' 
then, when eixj 0, one has: 

Opt(e)~\\\e\\\. 



(KY) 



(KZ) 



3.5.12 Remark. In the same way, the simple bound (FDI of Proposition 3.2.1 is asymp 



totically equivalent to the optimal bound ( |FM[ ) of Theorem 3.2.2 



Proof. Take {M + NM) i.i.d. jY{1) variables £,i,...,£,M,(^ii,---,(tiNM- For iiaij))ij e 



TtNxM 



, set 



Yj = ^j + Liaij(Oij. 



Denote 



{Xi : Yj}(7: y , V 



(LA) 



(LB) 



and define eij as the Pearson correlation coefficient of Xj and Yj under the law P['| 
X{i'<i],Y{jt<j] = 0]. Then, as in the proof of Theorem 3.5.3 one has eij - \eij\, and the 
function a ^ e is 'io^ around 0, with 



a + Oi\\af) when a — 0. 



M 



(LC) 



By the inverse function theorem, a e is therefore a diffeomorphism from some neigh- 
bourhood y of onto some neighbourhood U of 0, whose inverse function is such that 



a = \/M7+0iUf) whenJ— (). 



(LD) 



Now let e e (K+)^'^^ n U. Take aeV such that e(a) = e, so that the condition of (iKYl) 



is satisfied. For (peW^ ,y/eR^ with \\(p\\, 

fiX) 
g(Y) 



-- 1, set 
Li<PiXi; 



(LE) 



'^^^On the other hand, it is possible that the "a1" in \gd\ was such an artifact, since Theorem 
not optimal. 



3.3.1 



IS 



f*^There exist indeed situations going 'beyond the phase transition', i.e. for which Xzez ArcsinCez) > n/2, 



though this is not the case for Example 3.5.9 
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One has 

Var(/-) = M, (LF) 
Var(^) = 1 + OiWaf) = 1 + Oi\\ef) (LG) 

and 

nfg] = Y.aij(piVfj = {(p,eiff) + Oi\\e\\\ (LH) 

where the constants implicit in the "0(||e||^)" are uniform in (cpjif/). So one has 

Optie) ^{X:Y}^ Jf^/i' , = \((P,£n^)\ + Oi\\ef), (LI) 
Sd(/)Sd(^) 

whence after taking supremum over i(p,y/): 

Optie) ^\\\e\\\+Oi\\efy^'^\\\e\\\. (LJ) 



Since on the other hand Optie) ^ |||£:||| by Theorem 3.3.1, the proposition follows. □ 



3.5.13 Remark. If we state decorrelation hypotheses w.r.t. the whole cr-metalgebra of 
the system (denoted by *), no quantity analogous to etj shall exist any more; then one 
can only write, denoting e' ■ == {Xi : F,}* : 

I J J 



e'--ia)=-^ + Oi\\af). (LK) 



VM 

So, to see how the correlations depend on the parameters, we have to study the map 
a >-* e' , which is approximated by a homothety only on the cone R^'''^ — and which 
moreover is no better than continuous here. So we shall replace the inverse function 
theorem by an alternative technique, which will yield the slightly weaker theorem 
stated just below. 

3.5.14 Theorem. Define 

0pt'{ieij)(ij)euj) ■■= sup{{X/ : Yj} ; (V(i,j) g / x J)i{Xi : Yj}, ^ etj)}; (LL) 
then for any closed cone C ofU^""^ contained in (IR* )^''^ u {0}, on C, one has 

Opt'ie)'^^ \\\E\\\. (LM) 



3.6 Machinery for using the tensorization theorems 

Up to now we stated the tensorization theorems in a rather 'theoretical' form. To apply 
these results to 'concrete' situations, some additional techniques may be needed. This 
section gives such techniques, which we will use later for the applications of Chapter[5] 

In this section, all the probability systems considered will be endowed with their 
natural a-metalgebras, cf. Definition ^. 1.16 To alleviate notation, I will give no names 



to these a-metalgebras, but will plainly denote {X : F}* to mean "the subjective decorre- 
lation between X and Y seen from the natural a-metalgebra of the underlying system". 
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3.6.a The *doubling-up' technique 



3.6.1 Definition. For / a set and ^ a binary relation on I, Ji, J2 I, we will say that 
"J2 is S?-disjoint to Ji" if e Ji x J2 ^ i^J- 

3.6.2 Lemma ('Doubling-up' lemma). Let I he a (countable) set and let (X 1)1^1 be ran- 
dom variables such that for all i,j e /, {Xj : Xy}* ^ £ij for a certain family ofeij € [0, 1]. 

Let M be a binary relation on I; for i,j g /, denote efj ■= l^c^jEij. 

Define I = Ii\±> I2 to be a disjoint union of two copies of I; denote by (ii)iei, 
resp. (j2)jei, the elements of Ii, resp. I2- Assume that the following holds for a cer- 
tain e e [0, 1]; "if{Yi^)i^ei o-re random variables such that \li,j e / {Yi^ : Yjjt, ^ e^, then 
{Yi,:Yi,}^£". 

Then for all Ji, J2 I such that J2 is Si-disjoint to Ji, {Xj^ : Xj^}* ^ e. 



3.6.3 Remark. The interest of Lemma 3.6.2 is that, by proving one tensorization result 



on {Yj^ : YiJ, one gets tensorization results on all the {Xj^ : Xj^} for J2 5?-disjoint to Ji. 
3.6.4 Example. 



1. If you take for the equality relation, then Lemma 3.6.2 gives a decorrelation 
result for all disjoint Ji and J2- 

2. If / is equipped with a distance dist and if you take (iSS j) o {distii,j) < di), then 
you get a decorrelation result for all Ji and J2 such that dist{Ji,J2)^ di. 

Proof Assume that the hypotheses of the lemma hold and let Ji, J2 I with J2 ^- 
disjoint to Ji. For g I, define 

\Xi if (k=1 and ieJi) or (k = 2 and i g J2); ^^^^^ 
Yi^ = < (LN) 
[ d otherwise, 

for d some cemetery point in the range of none of the Xi. Since a constant variable 
is always independent of any variable, the hypothesis "{Xi : Xj}^ ^ Sij" for all i,j e / 
implies that {Yi-^ : Yj^}^, ^ e^, so, by the assumption of the lemma, {Yj^ : Yj^} ^ e. But 

Xj^ is Y/^ -measurable, resp. Xj^ is -measurable, hence {Xj^ :XjJ ^ e. 



Getting the subjective result w.r.t. * is just a variant of that reasoning, cf § |3.4.b 

□ 



3.6.b A practical result on Z'^ 

In Chapter [5| the situations we will handle shall always be of the following form: 

3.6.5 Assumption. For some n € N* , the system is made of random variables Xi, 
i G Z", which satisfy the condition 

yiJeZ" {Xi:Xj},^eij-i) (LO) 

for some symmetric function e: Z"' ^ [0, 1]. 



73 



For systems satisfying Assumption 3.6.5 one has the following practical synthetic 
result: 



3.6.6 Lemma. Consider a norm ]•] on IR", the associated distance on the affine IR" being 
denoted by dist. Then for a system satisfying Assumption \3.6.5\ for all Ji,J2<^ I: 



{Xj,:XjJ^(^e(2))Al. 

\z\>dist(Ji,J2) 



(LP) 



Proof. To alleviate notation, denote d ■= dist{Ji,J2)- Applying Lemma |3.6.2[ tak- 
ing for the relation "be at distance < d" (cf Example |3.6.4 2 1, our goal becomes 
the following: supposing {yi^)i^ezi^*}Zl are random variables such that {Fj^ : Yj2\* ^ 
je(j - i), we want to bound above {Yjn : Yjn}. 



To do this we apply Theorem 3.3.1 
where e is the following operator: 



and we get that {Yjn ; Y^n} is bounded by |||£||| a 1, 



e: 



To compute |||e|||, we split e as Zzez 1|2| ^ d^iz)Mz, where is the operator 

M,: LHZ) O 

igij))jeZ ^ igii + z))iez- 



..d£U-i)gU))iEZ- 



(LQ) 



(LR) 



Obviously IHM^IH = 1, thus |||e||| ^ — actually there is even equality — , which 



ends the proof of Lemma 3.6.6 



□ 



3.6.7 Remark. Instead of Theorem 3.3.1 here we could have used Theorem 3.3.18 



which would yield a better result; yet that would be very specific to (cf § |3.6.d| ), and 
the result would actually be almost equivalent to ([LP|> (cf §|3.5.b[>. 



3.6.C Avoiding the artificial phase transition 



Let us look again at Formula ( |LP[ ): the "a1" in it is not really relevant since a cor- 
relation level is always bounded by 1. In fact the situation is dichotomic: denoting 
d ■■= distil, J), either Z|z|&d£(2) is < 1 and then (LPl is a true decorrelation result, or 
it is ^ 1 and then (LPl tells us actually nothing. In other words, our result has a 
'phase transition' depending on the relative values of X|2|s<i^(2) ^i^d 1, similar to the 



phenomenon we discussed in Remark 3.5.10 



However, as I pointed out in Footnote [tl on page 71, it is not clear whether the 



phase transition we are dealing with is a real phenomenon: maybe it is rather an 



artifact due to Theorem 3.3.1s bound's being non-optimal, which could be avoided 



by a cleverer reasoning. We are strengthened in that thought by observing that, if 
T.zez'^^iz) < then for d large enough one has T-izi^d ^(d) < 1, so that there is no 
phase transition for long distances; why would a transition appear all of a sudden for 
short distances? 

This subsection will show that, indeed, phase transitions can be avoided in the 
situations we deal with. 
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3.6.8 Lemma. For a system satisfying Assumption \3. 6^ with e(z) < 1 as soon as z ^0 
and T^^oe(z) < oo, there exists a constant k <1 such that, for all disjoint Ji,J2 1, one 
has [Xj^ ■.Xj^}^k. 



Proof. As before, using Lemma 



3.6.2 



we have to bound above {Yjn ; Yjn} in the relevant 
doubled-up model. Our plan to avoid the phase transition will consist in reducing to 
the 'long distance' case. 

For some Z g N*, we split Z", resp. T'^, into a partition of =: A'" sublattices Z^^\ 
...,Z^^\ resp. Z^^\...,Z^^\ each lattice Z^"^ being of the form IZ"- + Zu for some z^ g 
^IJT' . I claim two fundamental properties of these sublattices: 

3.6.9 Claim. For all u,v ^ {!,..., N\, 



(LS) 



Z—Z\) Zu 



Proof. It is analogous to the proof of Lemma 3.6.6 



□ 



3.6.10 Claim. Provided I is large enough, the right-hand side o/ffl^ is (strictly) less 
than 1 for all the possible values ofu,v. 



Proof Denote ( ■■= sup^^o^^^^! notice that our assumptions imply that ( < 1. Since 
Zzez"£(2) converges, there exists some di < oo such that Lizixfi^C^) < 1 - C- Now, 
denoting do ■= min{|2| : ^ G Z" {0}}, for I > 2d\ -h do, for all u, v there is at most one z 
congruent to z^-Zu [mod. I] such that 1^1 ^ di, whence the following uniform bound 
for the right-hand side of ( |LS| ): 

^ l^^QEiz)^ ^ E{z)+ Y. ^(2) ^ E ^(2)+C<l. (LT) 

z=z„-Zjj |2|><ii \z\^di \z\>di 



\z\^di 

Z — Zi) Zii 



<l-C 



because 
the sum has at most 
one term, being S f 



□ 



Now, suppose I l arge enough so that Claim |3.6.10| works. We apply simple ten 



sorization (Theorem 
for any u G {1, . . .,N} ', 



3.2.2 1 to the Y^(vy. writing that Yjn = (F^cd, . . . , Y^im), we get that. 



{Y-,(u) : Yz^}, 



(LU) 



Now we write Yjn - (Yja),. ■ ■ ,Yj(N)) and we apply simple tensorization again — this 



time to the Y-^m — to get: 
^1 



{Yjn : Yjn} ^ 1 - (l - ; Yjnjlj < 1. 

\ u=l ^ 



Bound ( LV I achieves our goal. 



(LV) 

□ 
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3.6.11 Remark. With that proof, the way k depends on ei-) is rather comphcated; in 
particular, you cannot express ^ as a function of only Zz^o^(^) sup^^^o^^^)- 

3.6.12 Remark. In the case ?i = 1, at first sight Lemma 3.6.8 seems to contradict The- 
orem in which we told that Theorem |3. 3. 10[ which does have a phase transition, 
was optimal. The explanation for this paradox stands in the slight difference between 
the assumptions of Lemma 3.6.8 and Theorem 3.3.10 while in Lemma 3.6.8 we really 
imposed that {Xj :Xj}^ ^ E{d(i,j)), with "*" denoting the full natural cr-metalgebra of 

-more precisely, in the version of Theorem |3.3. l^The- 



the system, in Theorem |3. 3. 10 
orem 



3.5.3 proved to be optimal, which was the minimal version of this theorem (cf 



§ |3.4.a[ ) — the conditions on subjective decorrelations were a bit looser. That difference 
makes all the tr ick when one performs the steps of simple tensorization in the proof 
of Lemma 



3.6.8 



because these steps require subjective decorrelations w.r.t. the YxCu), 



which the sole assumptions of Theorem 3.5.3 do not provide. 



3.6.d Non-flat geometries 

It is natural to ask what we one can do when the basic variables Xi are not indexed 
by Z", but by the vertices of a more general graph, for instance a tree or a finitely 
generated group. This shall occur indeed if the physical space one works in exhibits 
some curvature — though Chapter |5] will not handle such situations. 



Actually for general graphs there are results analogous to those of §§ 3.6.b 



and 3.6.c[ with similar (though more technical) proofs II35II . Here I will only give the 



statements of these results. 

In this subsection the situation will be the following: 

3.6.13 Assumption. The system is made of random variables (Xi) indexed by a (count- 
able) set /. There is a group G acting transitively on /, and / is endowed with a 
symmetric map d: I xl ^D, called the 'abstract distance', which is preserved by the 
action of G. We assume that one has 

yijel {Xi:Xj},^eidii,j)) (LW) 

for some function e: 2) ^ [0, 1]. 

3.6.14 Definition. For d g 2), we define val(d) ■■-#{i el : d(o, i) = d}, where the choice 
of o e / does not matter. 



Then the analogous to Lemma 3.6.6 is the 



3.6.15 Lemma. For 2)' c 2), for all Ji, J2 <= I such that (i e g J2) ^HJ) ^ 2)', 

{Xj, : XjJ ^ ( E val(d)£(d)) A 1. (LX) 

deTi' 



The analogous of Lemma 3.6.8 is the 



3.6.16 Lemma. Assume that Assumption 3.6.13 is satisfied; denoting by the (com 



mon) value of the d(i, i), also assume that val(O) = 1 and that aid) <1 as soon as d ^0. 
Assume that Xdes val((i)£((i) < 00. 
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Moreover, assume that the action ofG on I is profinite (cf. I^5\ Definition 1.1]), i.e. 
that there is a subset jy (zN* such that for each N g Jt^, there is a subgroup 
such that: 

(i) The action ofGjsi splits I into exactly N orbits . . 

(ii) Gn is normal, so that the partition of I into the /'"^ is stable by the action ofG; 
(Hi) Any two distinct points of I are ultimately separated by the partitions induced by 

the Gn, i.e.: 

ii^ (Gat •o) = {o}. (LY) 

N^oo 

Then there exists a constant k < 1 such that, for all disjoint Ji,J2 I, one has 

3.6.17 Example. For 7 = Z" on which G = Z" acts by translation, equipped with the 
abstract distance '0(x,y) 



are 



{±(y-x)}, the assumptions of Lemmas |3 . 6 . 1 5] and |3 . 6 .16 
checked, and these lemmas re-give resp. Lemmas 3.6.6 and 3.6.8[ 
3.6.18 Example. For I the modular group PSL2(Z) acting by left multiplication on it- 
self, equipped with its natural abstract distance (i.e., Vii,j) = {i~^j,j~^i}), the assump- 
tions of Lemmas 13.6.151 and 13.6.161 are also checked — to see that the action of G on 7 
is profinite, take for the Giv(/) the principal congruence subgroups T(l) of the modular 
group [36.1 . Notice thatPSL2(Z) is an example of graph having negative curvature 1,181 . 



3.7 Appendix: Illustration of the proof of Theo- 
rem |3XE 

This subsection is devised for the readers who would like to understand better the 



proof of Theorem 3.3.1 by seeing how it works on a concrete example. It only contains 



pedagogical material, and thus can be skipped safely. 



3.7.a A Gaussian system of variables 

In this illustration we take N = 2,M = 1 — since M = l,Yi will merely be denoted by Y 
— , and we take {Xi,X2,Y) Gaussian (and centered), whose law is described through a 
3x3 matrix via writing that, for some standard Gaussian vector (^1,^2,^3) ^ H^^, 



(LZ) 







' a\ a2 oc^^ 




{^l\ 


X2 




Pi P2 h 






[y) 











We denote the matrix appearing in ( LZ I by M. The rows of M will be denoted by a, )6, 7 e 
R^, and {£,\,£,2,^^) will be denoted by e IR^. On IR^ we will use the Euclidian scalar 
product "•" and the associated norm 

The advantage of this model is that, by of the general properties of Gaussian vectors 



(in particular Theorem 1.2.6 ), all the quantities of interest are computable exactly. 

First we compute the correlation levels: by Theorem 1.2.6[ 

la • wl 

{XirF}^ „' „„ '„ , (MA) 
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Figure 1: Visual definitions of the vectors derived from a, fi and co: the left drawing shows 
how to build p^,(jD^,f5* ,cd*; the right drawing (with different values for a,fi,(D) explains the 
construction of a, fi, a, p, fi, p^. 



similarly {X2 :Y} = 1/5 -aj]^ \\a)\\; and 



{X:Y} = Jl-^-^ — i-^r, (MB) 
V ||a>||2||ax^||2' 

where "x" denotes the cross product on U^. Concerning the conditional quantities, de- 
note by p^, resp. w^, the (orthogonal) projections of ^6, resp. o), on Ua, and jS*, resp. co*, 
the projections of the same vectors on (Ka)-*-, i.e. (assuming that a ^ 0): 

I3^:=il3-a^\\af)a, co^ ■■= (w- \\af)a; (MC) 



P*:=I3-P^, co*:=a)-a)^ (MD) 

(see Figure Then one has ELXglXi] = ^}^i + ^^.^2 + i6^'^3 = i^^-^, resp. E[r|Xi] = w^-^, 
thus X2-E[X2|Xi] = P*-^, resp. Y -E[Y\Xi] = oj*-^. As {Xi,X2,Y) is Gaussian, the 
law of (X2-E[X2lXi],y-E[y|Xi]) under P[-|Xi = x] does not depend on the value 
of x; therefore we know all the conditional laws of {X2,Y) under the P['|-X^i = x\, and 
for all these laws {X2 : Y} is equal by Theorem [r2^ to |)6* • w*K II ||, so in the 
end: 

{X2:Y)x, = .l^*'.fl.. . (ME) 

11/3*11 ||tL»*|| 

[Xi : Y}x2 can be computed by a similar formula. 



Now let us 'dissect' the proof of Theorem 3.3.1 on our example. We take f linear, 
namely 

fiXi,X2)--=Xi+X2, (MF) 
so that all the computations shall again be tractable exactly. 

Let us start with computing the quantities linked to /"°: one has 

f^ = f = (a + )6)-^; (MG) 
f0 = f<yix,) ^ Xi + (X2r^^'^^ia + p^)-^; (MH) 
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whence respectively 



y = yO = \\a + l5f = \\af + \\pf + 2a-l5; (MJ) 
V° = \\a + p^f = \\af + 2a-p+—^; (MK) 

= \\P*f = \\l5f-^-^. (ML) 



By the way we check that, as claimed by Formula (g, y° = y° + V^. 

Now we turn to the quantities linked to f^. First we have to compute the con- 
ditional laws of iXi,X2) under the events "Y = y". The technique is the same as for 
computing {X2 : Y}xi a few lines above: denoting by a, resp. by p, the projections of a, 
resp. )6, on Uo), and a, resp. /3, the projections of the same vectors on (IRo)-'-), i.e. (see 
Figure [l]) 

a--=ia-(D^\\wf)ci), ^■.= il5-co^\\ci)f)(D; (MM) 



a--=a-a, ^ -.= 13-13, (MN) 

one has E[Xi|r] = a • ^, resp. E[Z2|F] = p-^, thus Xi-E[Xi\Y] = a • ^, resp. X2 - 
E[X2|y] = and (Xi-E[Xi|r],X2-E[X2|F]) has the same law under all the P[-|l^ = 
y]. So we can compute the quantities linked to in the same way as we computed 
those linked to /"": denoting 

)6:=f3^a; (MO) 
\\aV 



(see Figure [T]), one finds 



whence respectively: 



I3'^:=P-I3 (MP) 

= ia + p)-^; (MQ) 
fl = (5 + ;6)-^; (MR) 
f2 = 0^-^, (MS) 

= \\a + pf; (MT) 
= \\a + pf; (MU) 
Vi = WP^f. (MV) 

As for f^, we check that V-^ = V^ + Vj^, since a + /3 is the orthogonal sum of a + j6 and (3^. 
Moreover one always has ^ resp. ^ V^: the first inequality follows indeed 
from (a + j8)'s being the projection of ia + (3) on (Uo))-^, and the second one from (3^'s 
being the projection of 13* on ([Ra» + IRa)"*". These inequalities are consistent with the 



following corollary of Claim 3.3.4 obtained by applying the claim conditionally to '^j-i 
with the role of"/"" played by f-^~^ and the role of "F" played by Yj: 

3.7.1 Proposition. For all 1 ^ j ^ M, all ^ i ^ N, 

i'>i i'>i 
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3.7.b Numerical computations 



Now let us see a numerical example. Our parameters will be chosen so that the func- 



tion f defined b y ([MF| ) is optimal in the supremum ( |AG[ ) defining the Hilbertian correla- 
tion coefficient {X :Y); other than that, the behaviour of our example will be generic: 



1 



1 
4 
1 



1\ 
1 



(MX) 



VI 1 A) 

For that M the calculations of the previous subsection give: 



X 


a 




r 










0)* 


a 


)6 


a 




P 




Xi 


4 


1 


1 


-3 


2 


2 


-1 


-1 


1/2 


1/2 


7/2 


1/2 


7/6 


-2/3 


X2 


1 


4 


1 


-3 


1/2 


1/2 


7/2 


1/2 


1/2 


1/2 


1/2 


7/2 


1/6 


10/3 


13 


1 


1 


4 


15 


1/2 


1/2 


1/2 


7/2 


2 


2 


-1 


-1 


-1/3 


-2/3 



whence {Xi : Y} = 1/2 and {Xi : Y}x2 = 1/3, thus {Xi : Y}^ = 1/2; and similarly {X2 
Y}m = 1/2. 



Then Theorem 3.2.2 yields 



{X:Y}^l/^ = 0.707..., 



and even, according to the refinements of § 3. 4. a 



{X:Y}^V 13/6 = 0.600...; 
on the other hand, the true result is: 

{Z:Y} = l/^ = 0.577.... 
So here the bound (|GD|) is (fortunately!) correct, and even rather sharp. 



(MY) 



(MZ) 



(NA) 



Now, as the proof of Theorem |3.3.1| consists in studying the relations between 
the W, let us see what these quantities look like here. One computes: 



y y « 



13, 



[ 24 12 



3.7.1 



(NB) 



12 ^ 



As a first consequence, we can check the conclusions of Proposition 

= 1 3^, re sp. Vl + = 36 ^ + = 54. Going further, we check the conclusions of 
Claim |3.3.5[ which forbids the differences V° - and (V^ + V°) - (V^ + V^) to be too 
large: for the first difference, one has -V} = ll ^ elVS = 3f^ 



[§] 



and for the second 



one, (yo + V^) - (Vl + V^) = 18 ^ (eiV^ + JV^ - V^f = 19.419 . . . . 



'^According to § 



3.4.a 



one can replace £2 - 1/2 by {X2 : Y}xi - 1/3 in this inequality. Then the in- 



equality even becomes an equality: this is linked to the optimality of certain tensorization results for 



Gaussian variables, cf § 3.5 
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3.7.C Some traps to avoid 



To finish with this appendix, I would like to co mment on what is true or not about 



the V in general situations. Proposition 



3.7.1 



pointed out that for all le {0,...,N}, 



T.i>7V^ is a nonincreasing function of j; in particular, when one looks at the table of 
the , the last term (i = N - 1), resp. the total (i - 0) of line j can only decrease. 
Moreover, if in some line j all the are zero from some position i + 1, then this 
property remains true in all the lower lines j' > j. That can be explained very simply, 
since saying that all the are zero from position i + 1 means indeed that f is ("J^y v^^)- 
measurable, hence a fortiori C^f v ^j)-measurable. The following example, in which f 
turns out to be 2Xi, illustrates this phenomenon: 



M 



fl 0^ 
10 
Vl l) 



'4 
2 



(NO 



However, keep careful: almost anything else you would like to say about the table 
of the W would be false! In particular, for i < N, V- is not a nonincreasing function 

of j in general; it is not even true that = ^ W ^'^ = 0, as shown by the following 
example: 

1 \ 
1 



-1 



1 1 





1 ; 



^2 
^2 



' 



1 

1/2 



(ND) 



It is not true either that, if V-' remains unchanged from one line to another (that is, 
the total of the remains unchanged), then all the V- are unchanged. In fact, that 
is equal to means that, conditionally to "^j, f-' is centered w.r.t. Yj+i, and then 
f-'^^ = f-'. However, the way f-'^^ decomposes into a sum of ff^^ may be different to 

the way /"-' decomposed into a sum of because conditioning w.r.t. Yj+i may make 
the law of the Xj change! That is what happens in the following example: 



M: 



(1 1\ 

1-1 
1; 



a/2 3/2 

1 1 



(NE) 



3.8 Appendix: A corollary of the Perron-Frobenius 
theorem 



In this appendix I handle a lemma used in the proof of Theorem |3.3.1] We are working 
on the vector space for some AT > 0; a vector or a matrix is said to be > if all 
its entries are positive, resp. 3^ if all its entries are nonnegative. Then the Perron- 
Frobenius theorem II23|, Theorem 8.3.1] states that if a square matrix A is 3^ 0, then A 
has some ^ eigenvector for the eigenvalue p(A). Our goal here is prove the following 
corollary: 

3.8.1 Lemma. Let A^O be a square matrix, then: 

inf{A^O: (3u > 0)(Au ^ Au)} = p(A). (NF) 
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Proof. We prove separately each way of the equahty Let us begin with way Let v ^ 
be some eigenvector of A for the eigenvalue p(A). If i; > 0, then the value A = p(A) 
checks the condition in the infimum and we are done. Otherwise if i; ^ 0, up to a 
permutation of indices it has the form {Q,...,Q,v'^^^,...,v'j^) with < n < N and all 
the v'- positive. Reasoning by induction, assume that we have proved the way of 
the lemma for all n <N. Then the form of the eigenvector v forces A to write blockwise 



( A 

* * 



(NG) 



with U"'^"' 3 A ^ 0. I claim that p(A) ^ p(A), since if v is an eigenvector of A for the 
eigenvalue p(A), then for t^O 

A^{d, 0, . . . , 0) = {piAY U, *,...,*), (NH) 



so 



lim p(Ar* |A*(iJ,0,..., 0)1 >C™ 



(NI) 



and consequently p(A) ^ p(A). Now let £ > 0. By induction hypothesis there exists 
some R"-3w>0 such that Aw ^ (p(A) + £)w. Thus for r]>0,U^ 3 (rjiv, v') > and 

Air]w,v') = (r]Aw,piA)v' + 0{r])) ^ (r]{p{A) + £)w, p{A)v' + 0{r])) ''i" (p(A) + E){r]w, v'). 

(NJ) 

So (p(A) + e) checks the condition in the right-hand side of the infimum, which ends 



the proof of the way of (NFl. 



For the way consider any IR^ 3 u> and let again i; ^ be some eigenvector 
of A for the eigenvalue p(A). Then there exists a (unique) ^6 ^ such that u - (iv ^ 
but u- Pv ^0. For this )6, one of the entries of fiv and u is the same, say (ivi^ = Ui^. So 
if A <p(A), 

Xui, < piA)ui, = p(A)pvi, = iA(pv))i^ ^ + (^(" - = (Au)i„ (NK) 

thus Au ^ Au. That relation being true for any u > 0, A does not check the condition in 



the infimum, which proves the way "3^" of ( NF I. □ 



3.9 Appendix: A geometric consequence of results 
on correlations 



As I pointed out in Remark 3.5.6, for Gaussian vectors Hilbertian correlations can 



be interpreted in terms of Euclidian spaces. In this appendix I will present a funny 



corollary of Lemma |3 . 5 . 5 1 following from this interpretation. Actually that result itself 
is more or less a pretext: the real goal of this appendix is in fact to show in an eloquent 
way the geometric meaning of maximal correlations and the Hilbertian frame that 
underlies them. 

First we need some vocabulary about Euclidian spaces: 



'^'"^Equation (NI i is meaningless if p(A) - 0, but in this case there is nothing to prove. 
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3.9.1 Definition. 

1. For Li,L2 two vector lines in the Euclidian space K^, or more generally in any 
Hilbert space, we caW geometric angle between Li andL2, denoted by L1L2, their 
"angle" in the elementary sense: for arbitrary a e L 1 {0}, & G L2 {0}, 

— \a-b\ 

LiL2 = Arccos- =^ e[0,n/2]. (NL) 

Hall II6II 

2. For L\,L2 and L3 i^h\,L,2 three vector lines in the Euclidian space (or any 
Hilbert space), we call apparent angle between L\ and L2 seen from L3 the ge- 
ometric angle that an observer located somewhere on L3 {0} would have the 
impression, due to perspective, thatLi andL2 make (see Figure [2]l: technically, 
it is the geometric angle L'^L'^, where L'^ and L'^ are the respective orthogonal 
projections of Li and L2 onto the plane (La)-*-. 



Then one has the following corollary of Lemma 3.5.5 



3.9.2 Theorem. Let Li,L2,L^ he three distinct vector lines of K^. Denote A ■■- 
L2Ls,B ■■- LaLi,^ := L1L2, and denote by A' the apparent angle between L2 and L3 
seen from Li, resp. B' the apparent angle between L3 and Li seen from L2, etc.. Then 
the relative order of A and A' is the same as the relative order ofB and B' and as the 
relative order ofD. and O', i.e., "A' < A" (resp. "A' = A", resp. "A' > A") is equivalent to 
"B' < B" (resp. "B' = B", resp. "B' > B"), etc.. 



3.9.3 Remark. I found Theorem 3.9.2 by chance, one day that I was looking for a sit- 



uation where one would have B' > B but A' <A, in order to build a 'nice' example for 



§ 3.7.b I thought that such a situation would be generic, but after having looked for 



it without success, I realized that it was actually impossible, and that the explanation 
had a simple interpretation in terms of correlations. 



Proof Fix three arbitrary nonzero vectors a, 15,0) of resp. L\,L2,L^; and consider the 
Gaussian system ( |LZ| ) of § |3.7| for these vectors. Then the correlation coefficients be- 
tween Xi, X2 and Y can be interpreted as angles between Li, L2 and L3; more pre- 
cisely, one has the following correspon dance: 

3.9.4 Proposition. 

(i) {Xi : X2} is the cosine of the geometric angle between L\ and L2; 

(ii) {X\ : X2}y is the cosine of the apparent angle between L\ and L2 seen from L3. 



Proof ^ is nothing but the Euclidian interpretation of Theorem 1.2.6[ ^ follows from 
the fact that, in the vector space spanned by jointly Gaussian real random variables, 
conditional expectation corresponds to orthogonal projection and independence corre- 
sponds to orthogonality. 

□ 



By Proprosition 3.9.4 in our situation Lemma 3.5.5 gives: 



{{Xi,X2) : r} = V 1 - sin^E sin^ A'. 



(NM) 
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Figure 2: This figure shows four different views of the same 3-dimensional object. What inter- 
ests us actually is only the three concurrent lines Li,L 2,-^3, but we added a die centered at 
their point of concurrency to see depth better on the pictures [Recall that on a die, the total number 
of points on two opposite faces is always 7.]. On the top picture, the die is shown in generic position. 
We represent the angles A, B and H; these angles are 3-dimensional angles, which we under- 
line by drawing them with double strokes. On each of the bottom pictures, the die is viewed 
from the direction of one of the lines (from the left to the right, L3, Li and L2), so that this 
line appears completely foreshortened. We represent the angles D.', A' and B' made by the two 
other lines as they appear on the drawing; we underline that these angles are 2-dimensional 
by drawing them with simple strokes [Note that in the case of B', the angular sector representing B' 
is not the projection of the angular sector representing B, but its supplementary — otherwise B' would 
be greater than 7t/2, which would contradict our 'geometric' definition of angles.]. On this example 
one has A ^ 58°, B ^ 71°, 17 ^ 15° and A' ^ 30°,B' ^ 34°, n' ^ 9°; so, perspective makes angles 
appear smaller than they are really for all three pairs of lines, which is in accordance with 
Theorem [SXH 
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Obviously the roles of Xi and X2 can be interchanged in the above argument, yielding: 



{(X2,Xi):Y} = Vl-sin2 A sinks'. (NN) 
But iXi,X2) and iX2,Xi) generate the same a-algebra, so {(Xi,X2) :¥} = {iX2,Xi) :¥}, 



and thus, comparing (NMI and (NNI: 



sinA^ sinB' 



sinA sinS 



(NO) 



This implies in particular that sinA,sinA^ and sin5,sin5' have the same relative 
order, so also do A, A' and B,B'. A cyclic permutation of Li, L2 and L3 shows that the 
result is still valid for O'. □ 



3.9.5 Example. See Figure [2] on page [84 
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Chapter 4 

Other applications of tensorization 
techniques 

In the previous chapter we have been seeing how Hilbertian decorrelation hypotheses 
between pairs of variables could yield 'global' results on an arbitrary number of vari- 
ables, by splitting functions of several variables into relevant telescopic sums. I used 
the word "tensorization" to qualify these results, as the conclusions were of the same 
nature as the hypotheses. 

But the techniques of § [3] can also be applied to get other tj^pes of results. In this 
chapter I am going to show how, from Hilbertian decorrelation hjrpotheses, one can 
get results on some classical features of particle systems which are not linked with 
Hilbertian correlations a priori. 

I will deal with two such features. First, I will look at the implications of p-mixing 
on the existence of a central limit theorem — more precisely, of a spatial central limit 
theorem, since I am more interested in random fields than in sequences (variables 
indexed by Z" rather than by Z). Very sharp results concerning this issue are already 
known; however, I find interesting to show how it goes with my 'tensorization-like' 
approach: this approach takes indeed a quite different way to do the job, which may 
be neater by certain sides. Moreover, the results are stated with a slighlty different 
vocabulary — namely, subjective Hilbertian correlations. 

Next, I will look at the question of spectral gap for Glauber dynamics. Though this 
point has already been thouroughly studied in a jS-mixing paradigm, this work, to the 
best of my knowledge, is the first to show how p-mixing can be used to tackle this issue. 

My main goal here is just to show how the techniques of this work may be applied 
to the problems of spatial central limit theorem and convergence of the Glauber dy- 
namics. Accordingly, I favoured the simplicity on proofs against the refinement of the 
results. 
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4.1 Spatial central limit theorem 



4.1. a Introduction 

A fundamental result in probability theory is the central limit theorem (CLT), which, 
in its standard statement, requires an assumption of complete independence. It is 
natural to wonder whether that assumption can be relaxed into an hypothesis of 'near 
independence'. Hilbertian decorrelations are a natural frame for such a generalization, 
since the CLT already takes place in an setting. 

Our point of view is motivated by statistical physics. Let Z" be a lattice, on each 
vertex i of which there is a random 'spin' Xi ranged in some space 3C not depending 
on i. We assume that the law of the system is translation invariant, i.e. that for all z g 
Z"^, (Xj+2)iez" has the same law as X^n. Then, for all z g Z", we denote 

£, = {Xi:Xi+,},. (NP) 

We are interested in situations where the Sz are sufficiently 'rapidly decreasing' as 
|2| ^ oo so that LzeZ" < oo. 

Let f: ^ ^ IR be a function such that f(Xo) is square-integrable and centered. The 
question is, does one get a CLT when summing f(Xi) for i in a large subset of Z", i.e., 
does the sum grow as the square root of the number of its terms and have asymptoti- 
cally normal distribution? For instance, we would like the law of the variable 

0^ii,...,in<l 

to weakly converge, when Z ^ oo, to some Gaussian distribution. 

4.1.1 Remark. Note that the limit distribution, if it exists, will have to be centered, but 
its variance will not be equal to Var(/"(Xo)) in general. 

In the case n = 1, extremely sharp results for this topic have been known from long; 
let us cite, among many others, [38, 24, 531 [HI. For n^2, similar results also exist; see 
e.g. im Theorem 5] for such a result, and [81 § 29] for a survey of the topic. All these 
proofs relie on some 'coupling' between (bunches of) the spins and other convenient 
variables which are close to them, but which are actually independent, so as to deduce 
the CLT for the former from the CLT for the latter. On the other hand, my proof will 
mimick Levy's proof of the CLT, hence needing no coupling argument. 

A priori the results presented here do not improve the state of the art; however, 
when turning to quantitative versions of these results, it is likely that the difference 
between the usual method and mine would yield a difference in the corresponding 
non-asymptotic bounds obtained. 

4.1.b Product of weakly coupled variables 

My results relie on the following 
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4.1.2 Lemma. Let N ^ 1 and let ^i,...,^n be a-algebras with {^i : ^ £i}*^ and 
denote 

£"=sup^f:jy. (NR) 

Let Oi,...,OAr he complex-valued random variables with ^ 1 a.s., such that is 
^i-measurahle for all i, with all the Oj having the same distribution. Then, denoting 
by (p the common value of the E[OJ, 

\E[l\^i']-(p^\^NE{l + e}il-\(p\h (NS) 

i 

Proof. Denote 5 -.= Sd(0). Since E[|0^|] ^ 1, the definition of (complex) variance en- 
sures that 5 ^ y/l - \(p\^. 

For all i G {0, . . . , A'^}, denote ■■= Wi'^i #j ; denote := Ui'^i ^i'] define 

vj/(0 (vi/(f))-^7 _E[^<^'|^,-_i] (NT) 

J •' 

and denote A^^^ == Sdi^>f). Also denote 

^i,j--^{^if'-n.^i\3Pj-i\. (NU) 

Usual manipulation on conditioning shows that, for i ^ 1, 

^ + ^J/^^-"(D._^.. (NV) 

Since IIOjIIlco ^ 1, one has also IKO^)"^^ IIloo ^ 1, hence 

Sd(x}/a-i)(jj,.)^,) ^ Sd(¥'"") = (NW) 

Similarly, it is obvious that ||^*'~"'^^||loo ^ 1, whence 

Sd(¥^-"Oij)^Sd(cDij). (NX) 

Now, I claim that 

4.1.3 Claim. 

Sd(Oi j) ^ eij5. (NY) 



Proof of Claim [073] Conditionally to ^j-i, ^ij is indeed the projection on of the 
centered -measurable function ($i - E[Oj|^y_i]), whose standard deviation is less 
than Sd(Oi) = 5 by associativity of the variance, so that Sd(Oij) ^ {^i : 3Pj}g;^_^5 ^ Eij5. 

n 



'^*^As in § 3.6 "*" stands for "the natural ff-metalgebra of the system". Moreover, in the same way as 
in § |3.4.a| it is actually possible in the statement of the lemma to replace that a-metalgebra by smaller 
ones. 
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In the end, we got that 



Since A° - 0, one has therefore: 



A^^Aj~^^ + eij6. (NZ) 



Vi,j A}^(l + e)(5. (OA) 



Now, denoting == E[^^'^], one has 



= I ^E[^y-"Oij]| ^ ^ Ay-^^Sd(Oij)^e(l + e)(5^ (OB) 



and finally 



iV iV 



which is (NS I if you recall that 5'^ ^l-\(pr. □ 



4.1.C A spatial CLT 

First I state and prove a CLT on cubes: 

4.1.4 Theorem. Consider a translation-invariant spin model on a lattice Z" and de- 
fine Ez by (nPI . Assume that Zzez" < oo- Then for any centered square-summahle 
function f : ^ ^R, there exists a constant a <oo such that 

^ E fiXi)'^^ia\ (OD) 

VP" ill- 

0^il,...,in<l 

where denotes convergence in law. 



Proof Denote by F{1) — or merely F — the left-hand side of ( |0D| ). 
What will be the value of cr? Clearly we must have 

(7^= limVar(F(Z)), (OE) 

which yields 



(7= /EE[/(Xo)/(X,)], (OF) 

V zeZ 

where the expression under the root sign, which is necessarily nonnegative, is finite 
because \TS.\_f{Xo)f{XM ^ E^Sdi{f{Xo))S6.{f{X^)) = e^Varif). By the way we will de- 
note 

z 

Fix some arbitrary 77 > 0. The assumption that L £2 < 00 implies the existence of an 
Zo < 00 such that 

E ^z^r], (OH) 

\z\oo>lo 
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where l^loo denotes max(|0i|,...,|2„|). By ( fOE) ), we can also fix an Zi < oo such that 

\Yar{F{l))-a^\^T]. 



(01) 



Now we will 'tile' the cube of size I into a 'patchwork' made of cubes of size li which 
I call "tiles", each tile being at distance at least Iq from the others, plus some "scrap". 
I denote by F the part of F due to the tiles and hy F* the part of F due to the scrap. 

Index the tiles by {1, . . .,N}, with N ■- [il + lo)^ili + Zo)J We write, with obvious 
notation, F =: Fi + • • • +Fn- For A e IR, denote 



¥(A,Z):=exp(iAF), 
(D^(A,Z):=exp(iAFj), 



(^(A,Z):=E[0,(A,Z)]. 



Then we are exactly in situation of applying Lemma [4.1.2[ which yields: 

I V^(A, I) - (pa,lf\ ^ Nijil + T])il - \ipiA, it). 



(OJ) 
(OK) 



(OL) 



Let us look at the asymptotics of Formula ( [OL| l when I ^ oo. We observe that, 
denoting 



E fiXi), 



(OM) 



one has 



ic fixed tile 



(ON) 



Since Ft is centered, its Fourier transform satisfies Ft(0) = 1, F^(0) = and F" - 
Var(Ft), so that 

limZ"(l-(^(A,Z)) = — Z^Var(Ft), (00) 

Z— >oo A 

where, denoting cr? := Var(Ft), we recall that l\ has been taken sufficiently large so 

^ 1 

that |cr? -cr^l ^ 77. Then, since one has the following asymptotics 

1 



for (OLI: 



(pi A, I) -* exp 



2" 



(OP) 
(OQ) 



It remains to control the contribution of F* 
4.1.5 Claim. There are at most 



1 



I 



1 



h + lo 



r+nhi 



n-l 



(OR) 



scrap spins. 
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By Claim [4X5 



\\F* hi ^ Var(F*) ^ [l - {-r\-f + n-^ 

then the contribution o^F* is controlled using the following immediate 

4.1.6 Lemma. Let X and H be real random variables with Il^i < oo. Then, for XeR, 

\E[e'^^^+"^]-E[e'^^]\^\X\\\H\\Li. (OT) 
In the end, putting everything together we get: 

\i^\yriX,l)-e-'''^''^ 



(tI; (OS) 



^h + lo' y ^h + lo' 

(OU) 

Since there were no upper restriction on the value of Zi, we can assume that we have 



taken it such that [li^ili + Iq)T ^l-rj. Then ( pU| ) becomes: 

,lW k( A, Z) - e-'^'^'^^ I ^ I e-^'^' - 'J^^l - _ 1 ^ 

(OV) 



The right-hand side of ( OV I can be made arbitrarily close to by taking 77 small enough, 
so we have proved that 

yXeU EW^^'^'^V^e-''"^"'^. (OW) 

By Levy's theorem on characteristic functions, this is tantamount to saying that Fil) 
converges in law to ^(a^). □ 

The CLT should remain valid for other shapes than a cube, since morally the ran- 
dom field f{Xi) should look like a Gaussian white noise at large scales. Indeed, the 
same proof as above yields a CLT for general shapes, where moreover convergence is 
uniform in the shape considered in some way Let us give a precise statement: 

4.1.7 Definition. An open subset C7 c IR" (not necessarily connected) is said to be if 
its boundary M is a submanifold of IR" (of codimension 1). We define the roughness 
of U, denoted by k(U), as 

K(t/):=SUp|||II(x)|||, (OX) 

xeM 

where 11(0 denotes the shape tensor of M ['22, Chapter 10], which measures the lo- 
cal deviation of M from being flat. Also, the Lebesgue measure of U will be denoted 
by voKt/). 

4.1.8 Theorem. Consider a translation-invariant spin model on a lattice Z" and de- 
fine Eg by p/^ . Assume that Zzez" < oo- Then for any centered square- summable func- 
tion f: X ^U, if(Uk)keN is a sequence of^^ bounded subsets ofU"- with supj^ KiUk) < 00 
and ilk)keN is a sequence of positive numbers tending to infinity, 

^ E fiXo'-^^iah (OY) 



where is the same as in Theorem 4.1.4 
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Proof. Just copy the proof of Theorem |4.1.4[ The only difference hes in proving the 



analoguous of Claim 4.1.5[ which is where one needs the K{Uk) to be bounded. Observe 



that we use the non-asymptotic form of our intermediate bounds to get a result inde- 
pendent of the precise shape of the Uk- 

n 

4.1.9 Remark. Another generalization of the CLT, still based on the idea that the field 
f{Xi) looks like a Gaussian white noise at large scales, is the statement that for (p a 
continuous function with compact support, 

^ ^ (p{Xi ^ DfiXi) M(T^I«n (pixf dx). (oz) 

This can be proved with the same methods as before. 



4.2 Spectral gap for the Glauber dynamics 
4.2.a Introduction 

In this section we are looking at a probabilistic system made of a large number of 
'elementary' random variables (Xj)je/ — / may be seen as lattice and Xj as the state 
of the particle being at site i. As is customary by now, theorems will only be stated in 
the case where / is finite, the infinite case being got by passing to the limit. 

4.2.1 Definition. Denoting by O the states space of X/, let P be a probability measure 
on Q.. The Glauber dynamics [i20l i161 associated to P is the Markov process on H 
having the following law: on each iel there is an alarm clock, all the clocks being 
independent and ringing with law Poisson{l). When a clock rings, the state of spin Xi 
— and only it — is flipped so that the state of Xj immediately after the flip follows the 
lawP(X,|Xj.{,,). 

In formal terms, the Glauber dynamics is the Markov process whose generator 5£ 
on L°°{D) is defined by: 

{^f){xi) = ^E[/(X7)-/(X7)|X7.{,} = X/M^}]. (PA) 

iel 

Let us recall some basic facts on the Glauber dynamics (see ['28', Chapter IV] for 
more details). By construction P is a reversible equilibrium measure for the dynamics, 
so if is self-adjoint on L^(P). Since obviously if 1 = 0, one can also define 5£ on L^(P), 
on which it is self-adjoint too. This leads to the following definition: 

4.2.2 Definition. The energy of f eL^(P) is 

<Sif,n={Lf,n (PB) 

The following immediate identity shows that S is always a nonnegative bilinear 
form: 
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4.2.3 Proposition. 

S{f,n= [ dPfo]^Var(/-|X7.{i}=X7.{i))- (PC) 

4.2.4 Definition. For A > 0, the Glauber dynamics is said to have spectral gap ^ A if, 
for all/"GL2(P), 

Sif,n^XYarif). (PD) 

What makes spectral gap interesting is that its positiveness is equivalent to expo- 
nential convergence to of the semigroup ie~*^)t^o onL^(P), the rate of convergence 
being equal to the width of the spectral gap. As the Glauber dynamics is one of the eas- 
iest ways to simulate the law P for complicated models, the stake of having exponential 
convergence for it is evident. 

Many works have been done on the spectral gap of the Glauber dynamics, see for 
instance Martinelli's St-Flour course [1291 . Several results state that, the less spins 
are correlated, the larger the spectral gap is. Yet the researchers who work on this 
topic generally express the decorrelation between the spins in terms of ;6-mixing (cf 



Definition 0.1.41, while it seems be more natural to look at them in terms of Hilbertian 



decorrelations, since the formula ( |PD[ ) stating the spectral gap problem takes place in 
a Hilbertian frame itself Thus my goal here will be to find a control on the spectral 
gap expressed in terms of p-mixing conditions. Since Hilbertian correlations look to be 
the minimal frame to study the spectral gap for the Glauber dynamics, hopefully the 
bounds yielded by this method will be sharp. 

Another noticeable feature of my approach is that it remains at a quite abstract 
level: no symmetry property of 7 or P need be assumed, all the work essentially con- 
sisting in manipulating relevant quadratic forms. 



4.2.b A lower bound for the spectral gap 



The central theorem of this section is the following: 

4.2.5 Theorem. Take I = {1,...,A'"}. Suppose that for all distinct i,j e / one has 

{Xi :Xj}^ ^ Eij < 1 — we will make the costless assumption that eji = stj. For i e I, 
denote 

li ■■= ^ = ^, (PE) 



and for i < j, denote 



eij : = 



(PF) 



Then the Glauber dynamics has spectral gap at least |||M||r^, where M is the (N x N) 
matrix defined by 



M 



(1 -ei2 

■•• 

••• 



■ • -£iN-l)N 

1 



••• 0^ 

•• •. : 
: ■•. ■•. 







In 



(PG) 
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4.2.6 Remark. The form of the first matrix in the right-hand side of (|PG|) ensures that 
it is invertible. Since moreover all the eij were supposed < 1, all the Eij and the Ij are 
finite; thus, the lower bound |||M||r^ is strictly positive. 

Proof. Let f he a centered square-integrable function on (r2,P). For /' c /, denote 
== cr{Xr). For iel, denote 

/•/':= /•■^7'.B-E[/-|^7.]; (PH) 



define moreover 



ff ■■= f^*''; (PI) 

■■= fj''-''-"^. (pj) 



Then by Proposition 4.2.3 one has 

Sif,n = Y,Yariff), (PK) 

i 

while the usual telescopic argument shows that 

Var(/-) = X^Var(/-<). (PL) 
i 

So to prove the theorem, we have to establish links between the different values 
Varifl ). It will be convenient to introduce the shorthands = Sdifj ). One has the 
following 

4.2.7 Claim. For I' (zl and ij el-^I' with j i, 

Af ^A^'^*^Ve,,-Af. (PM) 

I I '-J J 

Proof. Assume in a first time that I' = 0, and denote fi •= , fj ==/"?, /"^ == Z"*^' and 
3'i ■- ^{i}. Projecting the decomposition "/"^ = f- + (fi ~ //)" on L^(^j), one has fi = 
iff)^' + ifj)^', whence by the Cauchy-Shwarz inequality: 

Sdifi) ^ Sd((/-/)-^0 + Sd((/-,)^0. (PN) 

One has trivially Sd((/"j^)'^') ^ Sd(/";'); on the other hand, fj is X^-measurable, so 
Sd{{fj)^i) ^ Eij Sdi fj). In the end, A becomes 



Sd{fi)^Sd{f') + eijSd{fj), (PO) 



which is (PMI for I' = 0. 



In the case I' ^ 0, the same reasoning can be performed, except that one have to 
work conditionally to ^r- Then, taking fi = ff , fj = ff , ff = f^ &^i = ^r\ti{i}, one 



gets 



Sdifil^r) ^ Sdiff\^r) + EijSdifj\^rl (PP) 
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Now 



Sdifi) : 



Ifsdifi 



\Xr = xrfdPixrl 



(PQ) 



with similar formulas for fj and f'^, since all these functions are centered w.r.t. 



Therefore, integrating ( |PP[ ) and applying Minkowski's inequality yields: 

Sdifi) ^Sdifp + SijSdifj), 



i.e. (PMI 



(PR) 

□ 



For i ^ j, let us denote 



Claim 4.2.7 will be used through the following corollary: 
4.2.8 Claim. For all i < j. 



1 



(PS) 



(PT) 



Proof. We have to bound which here we rather denote A[f to avoid confusion 



with the notation of Claim [4^2?7| Applying Claim [O/T] with 7' = {1, . . . , 6 - 1} {a}, i = a 
and j = b, one has 

^[6-1] ^ ^(l,...,6-l}Ma) ^ ^{l,...,6K{a} ^ ^^^^{l,...,6-l}Ma} ^ ^[6] ^ ^^^^{l,...,6-lK{a}_ ^p^^ 



But applying again Claim 4.2.7 this time with 7' = {1, . . . , 6 - 1} {a}, i = b and j = a, 
one has 



{l,...,6-l}-{a} _ ,< 



[6-1] 



Combining (PUI and (PVI then yields (PTI 



(PV) 

□ 



Now let us show how Claim [4 . 2 . 8| implies the theorem. To avoid heavy formalism, 
I will detail the computations for 7 - {1,2,3,4} (rather denoted by 7 - {a,b,c,d} here 
to avoid confusions with "1" and "2" taken as numbers), hoping that generalizing is 
obvious then. 



First, note that 



Now, by a direct use of Claim 4.2.8 



A< = aM ^ 



1 



CO 



To bound A^, we have to iterate Claim 



4.2.8 



twice: 



I A^^^^ + EA A"" 

be 



1 



(PW) 



(PX) 
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Last, bounding requires iterating Claim 4.2.8 three times: 
1 



a5 = aL"]^ 



ab 



A^^^ + e aA< 



+ Eac^t + ?a6 Afe ^ la A""" + £adA< + EacK + A< 



(l-£^^)(l-el) 



, f . laA^° + IbSab^t + lc(?ac + Eab^bc^^t + (^ad + eacEcd + £ab£bd + £ab£bc£cd)^tl ■ 
( |PX|PY> 



(PZ) 



One can sum up Equations (PWMPZI into the matricial expression 







(1 


tab 


^ac ~^ ^ab^bc 


^ad ~^ ^ac^cd ~^ ^ab^bd ~^ ^ab^bc^cd 









1 


£bc 


£bd + ^bc^cd 


A< 










1 
















1 I 



^^aA„^^ 

V A^ ; 



(QA) 



If we look back at how the square matrix in ( [QA| ) has been constructed, we find that 



square^ 




(0 






£ad 


matrix 


CO 








£bc 


£bd 


in 


= L 











£cd 


















£ab 






1 


-£bc 


-£bd 





1 


-£cd 








1 



(QB) 



so in the end we obtain that 















A< 




i 


[^dl 







(QC) 



where M is given by (|PG|). Then it is immediate that Var(/") = Zi(A^) ^ 



\\\M\fzM7r^\\\M\fsif,f), QED. 



□ 



The bound we have obtained for the spectral gap is not symmetric by permutation 
of the indexes in I. It can however can be bounded by a simpler expression, which is 
nearly as good as the original one in concrete situations: 



4.2.9 Corollary. In Theorem \4.2.5\ M can be replaced by the matrix 

-1 



M' 



provided p{lN - M') < 1. 



( 1 

\-£lN 



-£l2 ••• -£lN 



-£{N-1)N 
-£{N-1)N 1 



(QD) 



Proof. Each entry of M is actually bounded by the corresponding entry of M'. To see 
it, we 'expand' the entries of M, resp. M'. First, notice that 1/(1 - ef.) can be expanded 
into 1 + £ij£ji + EijEjiEijEji -\ — , so that one has the expansions 



E n 

i<jl^-^jkl = ^ 



(QE) 
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and 

k 

Then, using the inversion formula il-A)~^ = L^q^* triangular arrays, one obtains 
that 

k-i 

{io,ii,...,ik) 1=0 
first condition 

where the meaning of "first condition" is given by the following 

4.2.10 Definition. A sequence (io, . . . , iy^) is said to satisfy the first condition if: 

(i) io = i and ik =j; 

(ii) ii 9^ ii+i for all I; 

(iii) ii+i < ii only if Z ^ 1 and = 

(iv) If < ii and I then ii+2 ^ ii. 

One has a similar formula for M': 

Kj= E U^iiiui^ m 

{io,ii,...,ik) 1=0 
second condition 

where 

4.2.11 Definition. A sequence (io,...,ik) is said to satisfy the second condition if it 
satisfies Conditions @ and ^ of Definition |4 . 2 .TO 



Since the second condition is obviously weaker than the first condition, one has 

There is a still weaker but even simpler formula: 
4.2.12 Corollary. Defining 



e: LHI) - LHI) 

ic(-i)i£l '-^ {Y.jjii£ijC(-j)i£j, 



(QI) 



the spectral gap of the Glauber dynamics is at least 

(1-Ille|||)+. (QJ) 
Proof. One has M' -{!- c)"^, so, provided |||A||| < 1, 

oo oo 

IIIM'III = 111(1 -A)-i||| = III ^ A*||| ^ ^ IIIAIII* =(1- IIIAIII)-^ (QK) 

k=0 k=0 



In the case |||A||| ^ 1, (QJ) is trivial. □ 
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4.2. c Avoiding the articial phase transition 



A common situation in which we would like to apply the previous results is when 
/ = Z" and Eij is of the form e{j - i) for some symmetric function e: Z" ^ [0, 1]. Then 



Corollary 4.2.12 tells that the Glauber dynamics has a (strictly) positive spectral gap 



as soon as Xz5^o^(^) < 1- But like in § 3.6.c we are going to prove that that bound is 



somehow 'artificial' and that it can be relaxed into the neater condition "Lzt^o £(2) < 00": 

4.2.13 Theorem. Suppose that 7 = Z" and that for all i,j e Z" one has {Xi :Xj}^, ^ 
eij - i) for some symmetric function e : Z" ^ [0, 1] such that e{z) <1 as soon as z ^ 0. 
Then i/LzeZ" ^(2) < c>o, the spectral gap of the Glauber dynamics is positive. 

Proof. The assumption on allows us to take Z < 00 large enough so that 

zeZZ"^{0} 

We split Z" into a partition of Z" == N sublattices Zi,...,Zn, each lattice Zu being of 
the form ZZ" +Zu for some Zu g Z'^/ZZ". Then we define an auxiliary dynamics: 

4.2.14 Definition. The sublattice Glauber dynamics is the Glauber dynamics for Xz" 
considered as the finite-dimensional vector {Xz^,- ■ . In other words, on each u £ 
{1,...,A'^} there is an independent Poisson{l) alarm clock, and when clock u rings, the 
state of the whole Xz^ is flipped in one shot according to P{Xz^ \Xz"-~^Zu)- 



we 



Now let fs L^iD,). In addition to the notation of the proof of Theorem 4.2.5 
introduce the following definition: 

4.2.15 Definition. For ue{l,.. .,N}, we define 

ff^^:=f-E[f\Xzn^zJ. (QM) 

4.2.16 Remark. The f^^^ are the equivalent of the ff for the sublattice Glauber dynam- 
ics. 



Fixing some 'boundary condition' JCz"^z„ on Z" -^-Zj^, we can apply Corollary 4.2.12 
to the Glauber dynamics for Xz„ under the law P['|-X^z"--Zu - xzn-~.Zu\- After integrat- 
ing, one gets that 

Var(/-(^^,)^(l-|||Cllir' E Var(/-f), (QN) 

ieZu 

where ^ is the operator on L^(ZZ") defined by 

a^)(i)= eiz)gii+z), (QO) 

zeZZ"-{0} 



whose norm is obviously bounded by ZzeZZ"-{0) = = C < 1- Then, summing (QNl for all u: 



N 

E VMf^J ^ (1 (QP) 



'iu) 



Now, let us apply Theorem 4.2.5 to the sublattice Glauber dynamics [Defini- 



tion 4.2.141. It yields that 



Var(/-)^|||M|||2£var(/-(^j), (QQ) 



98 



where M is some (N x N) matrix depending on the {Xz -Xz }* ■ But by Theorem| 3.6.8 



4.2.6 



Combining ( QP I and ( QQ I, 



{Xz^ : XzJ* < 1 for all ut^v, thus |||M||| < oo by Remark 
we finally get that the spectral gap of the Glauber djmamics for X^n is bounded below 
by |||M||r2x(l-O2>0. □ 



4.2.17 Remark. Like Theorem 3.6.8 Theorem 4.2.13 could actually be stated in the 



general case of 'abstract' metric spaces on which some group acts profinitely. 
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Chapter 5 
Concrete examples 



It is now time to see what the results of Chapters [3] and |4] yield for concrete models 
of statistical physics. I will try to give rather different types of examples, so as to 
illustrate the advantages of working with Hilbertian correlations: this frame is indeed 
quite general, as it requires little structure on the models considered. 



In § 5.1 we will look back at Ising's model, seeing how tensorization of Hilbertian 



decorrelations improves the results of § 0.1 and what other results are given by the 



theorems of § |4j We will also consider two kinds of generalizations, namely when the 
range of interactions becomes infinite and when the strength of the interactions is 
random (spin glasses). In the two next sections we will look at models with continu- 



ous states spaces: first a quite general class of linear models [§ 5.2 1, then a family of 



nonlinear models [§ |5.3p . Finally in § 5A we will see how one can consider time as a 



supplementary dimension of the system to get contractivity results for non-reversible 
Markov chains (hypocoercivity) on an infinite system of particles. 

In this chapter, all the probability systems considered will be endowed with their 
natural a-metalgebras, cf. Definition ^. 1.16\ To alleviate notation, I will give no names 



to these a-metalgebras, but will plainly write "{X : F}*" to mean "the subjective decorre- 
lation between X and Y seen from the natural a-metalgebra of the underlying system". 



5.1 Back to Ising's model 
5.1. a Standard Ising's model 

In all this section, we work on the lattice Z" equipped with its natural distance dist; 
accordingly | • | will denote the I ^ norm on . Recall the definition of Ising's model and 



the related notation that we introduced in § |0.1[ and Theorem |0 . 1 . 7| on the existence of 
a completely analytical regime. 

The following theroem states that Ising's model in completely analytical regime is 
p-mixing, i.e. that two distant bunches of spins are little correlated in the sense of 
maximal correlation: 

5.1.1 Theorem. For Ising's model on Z" in the completely analytical regime. 
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(i) There exists some f' > (the same as in Theorem 0.1.7, such that for all disjoint 
I,J Z"-, one has when distil, J) ^ oo that 



{wi : wj} ^ exp [ - iy/' + oil)) distil, J)], (QR) 
where the "o(l)" can be easily computed as an explicit function of distil , J), n, T, 



f' and the C appearing in Theorem 0.1.7 



(ii) There exists some k <1 such that for all disjoint I, J a Z"-, 

{0)1 :a)j} ^k. 



(QS) 



(Hi) Points and remain valid uniformly under any law of the form PI-Imk - <^k\ 
for if c Z" and cok g {+1}^ a 'boundary condition' on K. 



5.1.2 Remark. Let us compare Theorem |5. 1.1 with Theorem 0.1.7 Both theorems state 
decorrelation between distant bunches of spins above temperature T^; the difference 
reUes in using p-mixing rather than j6-mixing to quantify dependence between the 
bunches in Theorem 15. 1.1[ 

Both results give an exponential decay of correlations, with the same exponential 



constant xf/', but Theorem 5.1.1 is more powerful in the sense that the bound (QRI is 



uniform in the size of / and J while (|e]) was not. Moreover, thanks to Point ([n]) we 
get a non-trivial result for any choice of disjoint I and J, which was not the case 



beforehand. Recall that the drawbacks of Theorem 0.1.7 were inherent to /3-mixing, as 
Theorem 10.1.81 shew. 

Both result remain valid under conditioning. However, if one takes a random 
boundary condition — that is, if one works under the la w P[-\a)K ^C] f or som e non- 
singleton C c {±1}^ — , then Point (iiil of Theorem 



5.1.1 



fails (cf. Remark 3.1.4 1, while 



(|e]| is still valid by convexity of the total variation norm. 



5.1.3 Remark. Let us compare Theorem |5 . 1 . 1| with Theorem |0.1.9[ The result of Theo- 
rem [0?L9l can be rewritten: 



{co 



1 :^MxZ"-i} ^ 



(QT) 



Theorem |5 . 1 . 1 1 can be seen as a generalization of that result to the case where / and J 
have arbitrary shapes Moreover, Point (iiil also gives the existence of a conditional 
version, which we did not have before. 

There is however a price to pay for this greater generality, since we had to require 
complete analyticity rather than just weak mixing, which can be really more restrictive 

on page 



in some cases (cf Footnote [*] 



10) 



5.1.4 Remark. Continuing the previous remark, a natural open question is whether 
one can tensorize maximal decorrelation under assumptions of weak mixing type. In 
the case of Ising's model at least, I expect p-mixing to remain true — even for arbitrary 



shapes — as soon as T > Tc, because on the one hand Theorem 0.1.9 proves p-mixing 



f*^Note that in the case / and J are hyperplanes, we shew on page 24 that ( QR ' could be improved into 



-if ^ e 



-Iff X 



(QU) 
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between parallel hyperplanes, while on the other hand p-mixing seems to hold also in 
the 'opposite extreme case' when 7 and J make a check pattern. 

By the way, it is likely that the natural condition should not be weak mixing itself 
but rather something like strong mixing for cubes (often called merely strong mixing^^ 
which means that when a boundary condition is fixed outside a cube of arbitrary edge, 
changing one spin on the boundary has an effect in total variation which decreases 
exponentially with the distance to the spin changed. In fact it has been proved 113 111 
that in dimension 2, weak mixing is equivalent to strong mixing. 



Proof of Theorem 5.1.1 Theorem 5.1.1 will be a direct consequence of the work of 
Chapter [3] as soon as we show that, denoting by * the natural cr-metalgebra of the 
system (i.e. the a-metalgebra generated by the coi), for all distinct i,j e Z", one has 

{wi:coj},^coC'e-l'''^'''^''-'^Ako (QV) 

for some explicit co < oo and ko<l only depending on n and T. Then indeed. Proposi- 
tion |3]6]6] yields 

oo 

{wi-.I^j}^ ^ coC'e-^l^l = coC' £ #{5gZ": |5| = d}e-^'^ 

5eZ" d=dist{I,J) 
\5\>dist(I ,J) 

_ ^-(V/'+o{l))dist{I,J) ^qy^ 

whence Point Moreover, since 

^ coC'e-'^''^'''^''^Uoo, (QX) 

fieZ"-{0} 
\5\^dist{I,J) 



Point ^ follows from Lemma |3.6.8[ and finally ( |iii| ) is a consequence of § |3.4.b| about 
subjective results. 



So, we have to prove ( |QV| ). Let wk £ {±1}^, K Z"-, be some arbitrary boundary 
condition, and denote by Peon the associated law, that is. Peon = P['I^^a: = ^k^', our goal 
is to show that under Peon, for all distinct i,j £ Z", one has {cot : cdj} ^ coC'e"^'^^*^^^^'-^' a 
ko. 



The result is immediate ifieK, resp. j (since then coi, resp. wj, is constant and 
thus independent of everything), so we assume i,j tK. We begin with observing that 
if K is the set N{.i) of all the neighbours of i, equilibrium at i implies that, whatever 
the boundary condition may be: 

Pcon[^^i = -l],Pcon[^^i = +1] ^ (e^"^^ + D"' (QY) 

— the extremal cases being when fc)iv(i) = +1, resp. tt»Ar(j) = -1. Now in the^general case 
K czT"' ^ {i}, LaWcon\-i^i\ is an average of laws of the form LawicoilcoNd) - (Onh)), so that 
(QYI remains valid. Similarly, equilibrium on {i,j} gives that for all a,b € {±1}, 

Peon[wi = a and wj = b]^ {e^"'^ + 2e(4«+2)/r ^ ^^^-i (g^) 



'^^^ Strong mixing stricto sensu is actually the same as complete analyticity, so that mathematicians 
have got used to undermeaning "for cubes" — ^but strong mixing for cubes is strictly weaker than complete 
analyticity! 1.30. § 21. 
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Now, recall that the correlation level between two two-ranged variables can be 
computed by Formula (BDl, where \p ^„-PnP^\ is also l5i X,Y)/2 . Thus the bound 
"{(Di : ^ Coe"^^^^*^''-'^" is a direct consequence of Theorem 



0.1.7 



with 



1/2 



Co = 



= tanh(4re/T) + l. 



(RA) 



(e4«/r + i)-i(i_(eWr + i)-i) 
It remains to prove the bound "{wj : wj} ^ ko". We will use the following corollary 



of (BDl: 



5.1.5 Lemma. With the notation of Remark \l-2.2\ there exists a,b in the respective 
ranges ofX,Y such that 

{X:Y}^l-4pl (RB) 
Proof of Lemma [5. 1.5\ The difference p^-PaP^ gets its sign changed whenever a, 



resp. h, changes, so there are some a and h for which this value is nonpositive; more- 
over, denoting by {a, a'} and {b, b'} the respective ranges of X and Y , p^, -Pa'P^ is also 
nonpositive. Now one has 



PaP 



Pa'P 



b' 



^/PaPa'P^P^' 



VPaPa'P^P^' 



(RC) 



SO that either Pap'^ or Pa'P^' is ^ \/ PaPa'P^P^' ■ Up to changing notation we can as- 
sume that it is PaP^, and then 



{X:Y} 



\Pa-PaP^\ 



PaP^-p^ 



\/PaPa'P^P^' yPaPaJP^ 



^ 1 



P\ 



\/PaPa>P^P^' 



^1-4^. 



(RD) 

□ 



Combining Lemma 5.1.5 with ( QZ ), we then get the desired bound, with 



^o = l-4(eS"^^ + 2e 



,(4ra+2)/r 



+ir^<L 



(RE) 

□ 



Formula ( |QV| ) is also what we need to apply the results of Chapter |4] Indeed, de- 



noting e{z) •■= {Xi : Xj+z}*, it gives that Zzez" ^i^) < oo with eiz) < 1 as soon as ^ 7^ 0, so 
that Theorems 4.1.8| and |4.2.13 yield respectively: 



5.1.6 Theorem. In completely analytical regime, the spins Ising's model satisfies the 



central limit theorem, in the sense that the conclusions of Theorem 4.1.8 hold for them 



5.1.7 Theorem. In completely analytical regime, the Glauber dynamics for Ising's 
model has a (strictly) positive spectral gap, and this remains valid uniformly if one 
fixes a 'boundary condition' on the spins of some K c Z". 

5.1.8 Remark. As I told in Chapter |4} results of these kinds have already been stud- 
ied by other methods (see e.g. O [111 for the CLT and 1 29] for the spectral gap). For 
the standard Ising model in completely analytical regime, which is "very nice", these 
previous works apply well, so the two theorems above are not new. They are inter- 
esting however because of the new method used to prove them, which is quite direct 
and likely to apply to a broader class of models. Such models will be presented in the 
sequel of this chapter. 
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S.l.b Generalizations of Ising's model 



The previous results can be adapted to several kinds of generalizations of Ising's model. 
Let us expose some of them. 

Long-range Ising models 

A physically important case is the long-range Ising models on Z". In these models, the 
states space is unchanged, but the Hamiltonian H becomes 

Hiw) = -^Y.Jij-i)wiCOj, (RF) 

where J : Z" {0} ^ IR is some symmetric function with non-compact support such that 
J(z) 0(|2r'"+«^) for some a > 0. 

Let us state a decorrelation result for this class of models. The frame of the proof 
of the following proposition will work as well for the other generalizations of Ising's 
model. 

5.1.9 Proposition. There exists an temperature Ti < oo such that, provided T ^ Ti; 

(i) Equilibrium for the long-range Ising model is unique; 

(ii) Uniformly in i,j, {oJi-.Wj}^:^'' = °° Odj - il"^"^""'); 

(Hi) There exists some ko <1 such that for all i ^ j, {toj : coj}^, ^ko. 

Proof. The principle of the proof consists in coupling two Glauber dynamics with dif- 
ferent initial conditions. Recall that the Glauber dynamics is defined as follows: each 
spin has an independent clock ringing with rate 1, and when the clock of a spin rings, 
this spin is flipped so that its final state is drawn according to its equilibrium measure 
conditionnally to the state of all other spins. Namely, if the clock of spin i rings at 
time t, denoting as usual fi = T~^, 

P[coiit+)^+l]^ , (RG) 

2coshil3Zj^iJU-i)(^jit)) 

and P[wiit+) = -l] = l-'P[a)iit+) = +l]. 

To couple the Glauber dynamics, we will assume that, rather than just "ringing" 
at time t, the clock of i is a Poisson process on R+ x (0, 1), points of which are denoted 
by (t,y). Then, if at time t the clock of spin i has a point it,y), spin i flips to +1 if 
y < P[a)iit+) = +1], resp. to -1 if y 3^ P[wiit+) = +1]. 

Now, consider two Glauber dynamics a)~ and w"*" having the same Poisson process. 



but starting with different initial conditions. It will be convenient to assume that 
oj~{t = 0) ^ lo'^it = 0) almost-surely: then, as we will see, for the coupled dynamics one 
has (a.s.) co~{t) ^ oj'^it) \/t. At time t, denote by @it) the set of points where w~ and w"*" 
differ: 

eit) = {ieZ'': (o)Jit),ajtit)) = i-l, + l)}. (RH) 



'^*^In the cases where interactions can be antiferromagnetic (J < 0), monotonicity does not stand any 
more; the proof however remains valid with a heavier formalism, replacing ">" by 'V" and putting 
absolute values at the right places. 
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When the clock at spin i rings at time t, three cases have to be distinguished: 

1. lfy<exip{BY.ij^iJU~i)^l(t))/2cosh{BY.ij^iJ(j-i)^~iit)), then both and 017 

J 7- J J 7- J II 

flip into state +1; 

2. If y > ex^ipZj^iJij - i)o}]it)) 1 2cosh.{pZj^iJij - iWjit)), then both w+ and w7 
flip into state -1; 

^ T„ exp(/3I,y,J0--0wT(i)) exp(;3Ij^i JO-Ow+CO) . „. . ^ ^ ^ 

3. It 7^ — -rm^ r/ ■ ■^ < J < 7, — TTT^ ?7^-^T+77u, then cj uvos mto state +1 

while (1)'^ flips into state -1. 
Denoting 

zeZ"-{0} 

which is always finite by the assumption on J, the probability of each of the two first 
cases is bounded below by e~'^'^/2cosh(/3^). The probability of the third case is 

sinh(2j6I^ee(iK{i)^(j - i)) ,^ 

(RJ) 



2cosH^Zj^iJij-i)(^~jit))cosh{PZj^iJij-i)(^]it)y 

which is bounded above by P'Lj£Q{t)'^{i}<J(j ~ i) thanks to the following computational 
5.1.10 Lemma. For a^b two real numbers, 

sinh(6 -a) ^ (6 -a)coshacosh6. (RK) 

Proof. Making the change of variables x = (a + b)/2, t = (b - a)/2, we have to prove that 
for X €U,t ^ 0, one has: 

sinh(2^) ^ 2^cosh(x - ^)cosh(x + 1). (RL) 



If we consider the right-hand side of (|RL[) as a function of x, it is symmetric (since cosh 
is symmetric) and its logarithm is convex (since logo cosh is convex, its derivative being 
the increasing function tanh), so its minimum is attained for x = 0; thus it suffices to 



prove (RL I in that case, i.e. to prove that sinh(2^) ^ 2^ cosh^ t for all t^O. But sinh(2^) = 
2 sinh t cosh ^ , so we can simplify both sides by 2 cosh t, and then it suffices to prove that 
sinh t cosh t, which is true since tanh t for all t^O. □ 



Thanks to these estimates, we can define a process Markovian @*it) on ^(Z"^) such 
that almost-surely Q*(t) 3 @(t) \/t. This process has the following law: 

5.1.11 Definition. The law of 0* is defined thanks to independent Poissonian clocks 
indexed by (Z")^. For i ^ j the clock has rate pJij - i), while the clock (i,i) has 
rate e-^^/cosh(;6/). At ^ = one has 0*(O) = 0(0). If at time t the clock (i,j) rings, 
with i, then: 

• Either i e @*it-) and then 0* changes so that 0*(^+) = 0*(^-) u {jp 

• Or i ^ 0*(^-) and then 0* does not change. 

On the other hand, if at time t the clock (i,i) rings, then 0* changes so that 0*(^+) = 
Q*it-)^{i}. 

f^^Of course, if j e &*{t-) then 6* does actually not change. 
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Let A : = )6/-(e-^^/cosh(^^)). If we take E[#0(i = 0)] < oc 



[1] 



it is immediate that 

#©*it)/e'^* is a supermartingale. So, provided T is large enough so that A < 0, i.e. 

PJ< , (RM) 

cosh(;6/)' 

the two processes w~(t) and co'^it) tend to be equal when ^ ^ oo; in particular they have 
the same equilibrium. That proves Point ^ of the Lemma, since any initial condition 
stands between the 'extreme' conditions oj~it = 0) = -1 and co'^it = 0) = +1. 

Observe that the previous reasoning remains entirely valid if one reasons condi- 
tionally to some boundary condition of the form "wk - cok", with the same condition 
on T. 

Now we are turning to the correlation between two distant spins. Let i g Z" and 



let cok be some boundary condition on some if c Z" {i}. Suppose T satisfies ( |RM[ ); 
I want to compare the Glauber d5mamics corresponding to the boundary condition 
"ojK\t){i] - — where stands for the function on K\±){i} which is 

equal to w on if and to +1 at i — with the Glauber dynamics corresponding to the 
boundary condition - {(j)k,(-^)^''^)"- In this frame, one defines the process 0* 

as previously, except that one imposes that @*it)r\K = and i £ ©*it) for all t. This 
time, it is the equilibrium behaviour of 0* which interests us. Denote by Peq the 
equilibrium law of 0*; for / G Z" ^K, denote dij) ■■= PeqU ^ ©*]• Then d satisfies the 
following discrete subelliptic equation with Dirichlet boundary conditions: 

"^j^K^u) ^^eij)^ pZi'ez'^^KJii' -jm'y, 

V^Gif 0(^) = O; 0(i) = l. 



Define the convolution kernel a on Z" by 

f a(0)=l; 



VzT^O a{z) = Trff-pJiz), 



SO that ( |RN| ) writes in the bulk: 

a*e^O. (RP) 



Writing a -■■ do - a, Condition (RM) ensures that ||a||/i < 1. Since l^iZ"-) is a Banach 
algebra for the convolution operator *, with neutral element Sq, it follows that a is 
invertible with inverse 

a~* = 6o + d + a * a + a * a * a -\ — . (RQ) 

Since a^O, a~* is nonnegative ever5rwhere with a~*(0) > 0. Therefore the function i^ := 
(a~*(0)) ""^(Jj * a~* satisfies: 

yj€K^{i} ia*F)(j) = 0; 
MkeK Fik)^0; Fii)=l. 

'^'"^The general case where 0* can be infinite can be got from the finite case by passing to the hmit, 
despite some technicalities of little interest. 
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Comparing ( |RN| ) with (RRI, since ( |RN[ ) is subelliptic, we can apply a maximum principle 



to il^"^ which yields that O^F everywhere. But Jiz) = Oi\z\ so by Lemma 5.5.7 



in appendix, Fij) = Oi\j -i\ and therefore 

P[coj = +l\I^K = ^K,m = 1] -PK = - = = -1] = 0(|j - ir^"+"^ (RS) 

uniformly in i,j,K,a)K- 



The end of the proof, namely deducing Point (ju]) from ( |RS| ) and proving Point is 
then performed in the same way as to establish ( [QV] ) in the proof of Theorem 5.1.1 □ 



Thanks to Proposition |5 . 1.91 we can apply the results of Chapters [3] and |4j One gets 
the following 

5.1.12 Theorem. For the long-range Ising model on Z"^ at T ^ Ti, 

(i) For all disjoint I,J Z", uniformly in I, J, one has an estimate 

&i : wj} ^ 0{distil,jy"), (RT) 

where the O(-) can be turned into an explicit constant only depending on J and T. 
Moreover, there exists some k < 1 (still explicit and only depending on ^ and T) 
such that for all disjoint I,J Z", 

{c^i:wj}^k. (RU) 

(ii) The spins satisfies the central limit theorem, in the sense that the conclusions of 



Theorem 4.1.8 hold for them. 



(Hi) The Glauber dynamics has a positive spectral gap. _ 
(iv) Points and (iii* remain valid uniformly under any law of the form PI-Imk - (^k\ 
for K<^T^ and cok g {+1}^ a 'boundary condition' on K. 

Proof. The proof is the same as the work done in the previous subsection. The only 



difference is to prove ( |RT| >, which follows from the following computation: denoting 
D ■■= distil, J), one has that, when Z) ^ oo. 



E 

zeZ" 
\z\»D 



oo 



L 

d=D 



#{2£Z": \z\ = d} 



d=D"' 



d=D I" 



□ 



Spin glasses 

Spin glasses are another generalization of Ising's model. In these models, the inter- 
action constants are not invariant by translation any longer. The Hamiltonian writes 

H&) = - ^ E JiiJ)(^i(^j (RW) 

(with Jij, i) = J(i,j)), where the J(i,j) themselves are random. We make the following 
assumptions on the interaction constants: 

'^"^The maximum principle is generally stated in a PDE context, see for instance II19I § 3.1], but it 
works exactly the same for discrete equations. 
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5.1.13 Assumption. For distinct unordered pairs {i,j}, all the J{i,j) are independent. 



Moreover, J{i,j) is distributed according to some law only depending on (J 



[*] 



We will assume that all the have bounded support, and we denote by Joo(z) the 
smallest number such that Pj^[| J| ^ Joo(z)] = 1. 

5.1.14 Remark. Here the J{i,j) can be negative, which corresponds to antiferromag- 
netic interactions. 

In spin glass models, there are two levels of randomness: first to fix the J{i,j), 
next to take co according to the Gibbs measure associated to H. When both levels of 
randomness are taken into consideration, one speaks of annealed law. Here I am only 
interested in the quenched laws, which deal with the second level of randomness for 
fixed J{i,j). I will write sentences beginning with "for almost-all quenched systems", 
which mean that what follows is valid for almost-all Gibbs measures when the J(i,j) 
are taken randomly according to Assumption ^l.l3\ 



The machinery exposed above still works for spin glass models. We obtain the 

5.1.15 Theorem. Suppose that when 1^1 ^ oo, Jooiz) decreases at least as fast as 
Q^|2|-(n+a)-) yjj^ somc a > 0. Then there is a T\<oo such that, for the spin glass model 
on Z"' at T ^ T\, for almost-all quenched systems, 

(i) If Jooiz) - Odzl"^""*""^), then for all disjoint I,J c Z", uniformly in I, J, one has an 
estimate 

{wi : wj} ^ Oidistil, J)""). (RX) 



If moreover Jooiz) has exponential decay (see Definition 5.5.4 in the appendix). 



then the right-hand side of {M} can even be replaced by "d{dist{I,J))" for some 
function 6{') with exponential decay. 

In both cases, there exists some k <1 such that for all disjoint I,J Z", 

{coi:wj}^k. (RY) 



(ii) Points mf-mvi of Theorem 5.1.12 hold. 



Synthetic vocabulary 

For all the models considered in this section, the techniques used and the results stated 
walked along the same lines. First, one establishes a bound [Xt : X^}* ^ e(j -i) /\kQ for 
all i ^ j, for some sufficiently rapidly decreasing function e: Z"' ^ [0, 1] and some ko< 
1. Then, one applies the results of Chapters|3]and|4| which yield maximal decorrelation 
for distant bunches of spins (which is sometimes called (interlaced) p* -mixing) with 
uniformly non-full correlation between any two disjoint bunches of spins (which is 
sometimes denoted "p*(l) < 1"), central limit theorem, and spectral gap for the Glauber 
dynamics. 

Since this method will be used again in the following sections, it will be convenient 
to introduce some synthetic vocabulary: 

^*^Observe that one has necessarily ^' - Pj^ for all z; in particular the function Joo : ^ {0} ^ R+ 
shall always be symmetric. 



108 



5.1.16 Definition. If a spin model (spins can have arbitrary range) X on Z" satisfies 
some bound "{Xj :Xj}^ ^ e{j - i) Ako" for all distinct i,j g Z", with Zzez^-wi^^^) < °o 
and ko <l,we say that this model is well-p-mixing. According to our results, for such 
a model one has p* -mixing with p*(l) < 1, CLT and spectral gap. 

Moreover, 

(i) If e{z) = OCIzr^""*""^) when l^l ^ oo, then we say that the model is a-polynomially 
p-mixing. According to our results, in this case p* -mixing is polynomial with rate 
a, i.e. Formula ( |RT| ) holds. 



(ii) If e(z) has exponential decay (cf Definition 5.5.41, then we say that the model is 



exponentially p-mixing. According to our results, in this case p* -mixing has an 



exponential speed of decay (but not with the same rate as £(•), cf Remark |5. 5. 6[ ), 
i.e. a formula similar to ( [QR] ) holds. 



5.2 Quadratic models 



In this suh subsection, an arbitrary norm \- \ onJ."- is fixed. 

5.2.1 Definition. In our quadratic model, the states space is C = U^" for some neN* . 
For G O, i G Z", the real number coi will be called the polarization of particle i. 
Each particle i is submitted to two types of forces: 

• A pinning force, preventing the particle from having a too large polarization, 
which derives from the quadratic potential (W?/2; 

• Interaction forces: each particle j ^ i exerts a force on i which tends to make 
the polarizations of particles i and j equal; this force derives from a quadratic 
potential jj-iicjj - a>j)^/2. 

In other words, the Hamiltonian of the system is formally defined by 

^ ieZ" ^ ijij 

where the jz, for 2 g Z" {0}, are nonnegative numbers which we impose to satisfy the 
symmetry condition = j-^ for all z. Moreover we impose the that the sum of the jz 
is convergent, and we denote 

r:= rz<oo- (SA) 



The Hamiltonian i7 is a quadratic function of to, so at fixed parameter ^ the 
(infinite-dimensional) random vector cj will be Gaussian (and centered). Let us com- 
pute its covariance: the probability density of oj w.r.t. the 'Lebesgue measure' on VL is 
formally defined by 

oc exp {\a7{PQ)w), (SB) 



where Q is the (infinite-dimensional) symmetric matrix defined by 

Qij--=-Yj-i for i^j; ^g^^ 
Qii--1 + T on the diagonal. 
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thus the covariance matrix of w is ifiQ) . So we have to compute Q , the inverse 



matrix of Q. Since Q is a Toeplitz matrix (with re-dimensional indexes'^, — if 
it exists — will be of the same form. Now, knowing that it is a Toeplitz matrix, Q is 
described by the function aq: Z" ^ IR such that for all i,j, Qij = aqij - i). With this 
notation, ( [SC] ) rewrites: 

V2GZ" aQ{z) = lz = Q{l + T)-lz^orz. (SD) 

When coded by functions like aq, the multiplication of Toeplitz matrices becomes the 
convolution product: 

VM, A'" Toeplitz auN = o-m* o,n- (SE) 

So, will be the Toeplitz matrix whose aq i is the inverse of aq for the convolution 
product. Thanks to Condition ( [SA] ), such an inverse always exists: indeed we can write 
aq = {l + T){6q - aq), where aq is a nonnegative function with \\aq ||;i = F (1 + F) < 1, 
so that aq is invertible with 

ttQ* = (1 + F)"-^(5o + dq+aq *aq+aq *dq *aq + •••). (SF) 

In the end, at parameter p>0 the covariance matrix of w has entries: 

an-iij-i) 

CoviaJi,a)j) = . (SG) 

5.2.2 Remark. All the entries of Cov(a») are nonnegative, which reflects the fact that 
all the interaction forces are attractive. 

5.2.3 Remark. Since Cov(a)) depends on p only through a constant factor, the behaviour 
of the system is exactly the same, up to a multiplicative constant, for all P>0. Hence 
the study of correlations will not depend on p. 

In the sequel, we fix arbitrarily P = 1 and we denote P for P/3=i. 



Since the model is Gaussian, by ( |SG| ) and Theorem 1 1 . 2 . 6 1 one has for all i ^ j: 

an-iij -i) 

{coi:(^j]^— 77^. (SH) 

aq-iiO) 

Now we have the following claim, with an immediate key corollary: 

5.2.4 Claim. For all i ^ j, for all K UJ), 

{a)i:a)j}-;;;j^^{coi:tDj}. (SI) 

5.2.5 Corollary. Denoting by * the natural a-metalgebra of the system, for all i ^ j, 

aq-iij -i) 

{(Di:wj}*={(Di\Wj} = — — . (SJ) 

aQ-i(O) 



Proof. The proof of Claim 5.2.4 relies on the following claims: 
^^^Recall that saying that matrix Q is Toeplitz means that its entries Qtj only depend on 0' - i). 
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5.2.6 Claim. Up to an additive constant, Lawit)^", \a)K^ = mk) is the same for all wk £ 

i.e. there exists a vector-valued function cjk offsetiiOK) such that the law ofojjn 
under V\_'\oiK - is the same as the law o/wz" + offsetiiOK) under 'P\_'\a)K = 0]. 

5.2.7 Lemma. For {X,Y) a two-dimensional centered Gaussian vector with X and Y 
non-degenerate, 



{X:Y} 



/E[X2] 
E[y2] 



E[yix = i]|. 



(SK) 



5.2.8 Claim. For K c Z", the function off^set^ defined in Claim ^.2.6 is nondecreasing, 
in the sense that each of the entries of offsetiSx) is a nondecreasing function of each cjk 
for keK. 

5.2.9 Claim. For i£Z'',K^T'^ {i}: 



offsetil^'Ko^) ^ offsetil^'^), 



(SL) 



where (1^'', 0^) stands for the function on K\±) [i] which is equal to 1 at i and to on K, 
resp. l'^' stands for the function on {i} mapping i to 1. 

Admit temporarily the claims. Let i,j be distinct points of Z", let if c Z" ^{i,j} 
and let_WK_G our goal is to compute : coj} under PI-Icok - iox]. First, by 
we can suppose that cok = 0. Now under P[-\a)K = 0], {wi,a)j) is still Gaus- 



Claim 



5.2.6 



sian by the properties of Gaussian vectors, and it is centered by symmetry, therefore 
by Lemma 5.2.7 , {a>j : Wj} is equal to 



E[wj|wjf = and a>j = l]| 



(o/fse^(lf'',0^)-j) (SM) 



—one has indeed offsetil^'\0^) ■ j ^ 0, since by Claim |5.2.8[ offset{l^\Q^) 
offsetiQ^^"^^) = 0. 



Now, taking K - <^ m. ( [SM| ), we find that under the law P: 

{(Di : (Dj) 



E[w2] 



\ E[..5] 



{offseta^"^)-]). 



(SN) 



which is ^ ^E[w2]-E[a>2](o//sei(l<^',0^)-7) by Claim 

j2i . Trr...2 

getting the desired result 



5.2.9 



But up to switching the 

roles of i and j, we can assume that E[a)^] E[w^] ^ E[a»^|a»jf = 0] ELw^lw^f = 0], thus 



(offsetil^'K0'')-j) = {aji:ajj}- 



^E[coj\c!)K^O] 



MR- 



(SO) 



□ 



Proof of the claims. 

It is a well-known property of Gaussian vectors, which here is stated 



Claim 5.2.6 



in an infinite-dimensional setting. 



Ill 



Claim 5.2.7 - Since iX,Y) is centered Gaussian, y^^^^ is the orthogonal projection 



of the variable Y on IRX, so E[y|Z = x] oc x. Thus one has: 



/ 



E[XY]= xE[Y\X = x]dP[X = x]= E[Y\X = l]dP[X = x] = E[Y\X = 1]E[X% 



But for such a Gaussian vector, Theorem 1 1 . 2 . 6| gives that 

|Cov(X,y)| mxY]\ 



[X:Y}. 



Sd(x)Sd(3/) v/E[X2]E[r2]' 



(SP) 



(SQ) 



which combined with (SPI gives (SKI. 



Claim \5.2.8\ - First, notice that Eicoz'^l^^K = 0] = 0, so that 

offseticox) = E[(jo\coK = <^k]- 



(SR) 



Now, allowing temporarily /3 to vary again, by the properties of Gaussian vectors, the 
vector-valued variable TSt^icjzA^K = is Gaussian with constant expectation and 
covariance matrix proportional to fi. Therefore, the common expectation of all these 
laws is equal to the constant value of toz" for j6 = 0, which is the w minimising H under 
the constraint "ojk - cok"'- 



offset{(x)K) - argminifCw). 



(SS) 



Since it minimizes energy, the state offset&K) is at equilibrium outside K. In other 
words, it is the solution of the following subelliptic system: 



(ST) 



(That system is clearly subelliptic because the pinning and interaction forces all are 



attractive). By the maximum principle, the solution of ( |ST| ) is an increasing function of 
the boundary condition, which was our claim. 



Claim 



5.2.9 

TT 



Denote = offsetil^% Since offsetiO^'^) = 0^", by Claim 



has ^ 0^ '. Since obviously oj^ = 1, one has even wi,„ ^ (1^",0^ '"^'0. In particular. 



5.2.8 



one 



^'{i]it)K ^ (l'''>0^); therefore, using again Claim 5.2.8 
Now, we defined a>L as offsetil^'^^), so by Formula (ST I it satisfies 



(SU) 



(SV) 



for all i' G Z""^{i}, hence a fortiori for all i' g Z"' ~^({i}\±>K). Since moreover obviously 
coincides with Wj^.ji^^ on {i}\±)K, this implies, by Formula (ST I again, that 



offsetiwl^^j^) = = offsetil^'^). 



So, (SUI becomes "offsetd^'^) ^ offset{l^\O^T , what we wanted. 



(SW) 

□ 



112 



Thanks to Corollary 5.2.5 our tensorization theorems give decorrelation results for 
the quadratic model: 



5.2.10 Theorem. Provided Condition (SAJ holds: 



(i) The quadratic model is well- p-mixing, cf. Definition \5.1.16\ IfT<l, one can be 
more specific about the property "p*(l) < 1": for all disjoint I, J a 'P, {a*/ : wj} ^ F. 

(ii) Moreover, if there is polynomial decay of interactions jz - 0{ll\z\"'^°^), then the 
model is a-polynomially p-mixing, and ifjz has exponential decay, then the model 
is exponentially p-mixing (but not with the same rate as jz in general). 



Proof To prove Point we have to show that 

^ aq iiz) 

— recall that we assumed F < oo. We write: 

an-liz) Y.zeZ'^CLn-liz) 
y _^ = 1 1 

0eZ«MO}«Q-i(O) aQ-i(O) 



(SX) 



(SY) 



There, Lzez^c^Q 1(2) is equal to 1: indeed, aq 1 is the convolution inverse of aq, so by 
Fubini's theorem: 

^ aq-i{z) = { aqiz))'^, (SZ) 

zeZ" zeZ" 

where 

Yaqiz)^ Y (-r(2)) + (l + r)=-F + l + F = l. (TA) 

zeZ" zeZ"--{0} 



Now, by (SFl, i(0) is obviously bounded below by (1 + F) ^, so in the end: 

1 



^ aq-i(z) 
zeZ"MO}«Q-i(0)^(l + rH 



1 = F. 



(TB) 



To prove Point (|u]), we have to show that polynomial decay of jz implies polyno- 
mial decay of aq i with the same exponent, resp. that exponential decay of jz implies 
exponential decay of aq 1. This is achieved resp. by Lemmas 5.5.5 and 5.5.7 in the 
appendix. □ 



5.3 Nonlinear lattice of particles 

In this section we will consider a model with continuous spins, but where interactions 
are nonlinear, so that we cannot use the properties of Gaussian variables. One has a 
lattice of particles indexed by Z" (equipped with its l^ graph structure), each particle 
i being described by its "polarization" oji g U. Each particle is submitted to a pinning 
force deriving from a potential V , and to interaction forces with its neighbours, the 
interactions deriving from a potential W. In other words, the Hamiltonian is formally 

H(w)=^y(wi)+^X^K-^^)- (TO 

ieZ ^ i~j 

We make the following assumptions: 
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5.3.1 Assumption. Both V and W are convex; moreover V is uniformly strictly convex 
and the Hessian of W is bounded, i.e. there exist constants i;* > and if* < oo such that 
for all X e K, i;, ^ V"{x) and W"{x) ^w*. 

We are interested in the equilibrium state of the system at some inverse tempera- 
ture < ^ < oo. (In the sequel we suppose that p is fixed). 

Let i 9^ j eZ, K cZ^ {i,j} and ojk £ we want to study the law of under 
the law P['|wii: = wj^]. Then, the probability distribution of the system is formally 
described by 

dP{wi,(Dj,WK^-^{ij))<x exp{- pH{a)i,Wj,0JKc^{ij],a)K)). (TD) 

Our assumptions ensure that the function Hi-,-,-,a)K) is uniformly convex, so that the 
equilibrium exists and is unique. 

For the sequel, we need to recall the definition of the Woo Wasserstein distance: 

5.3.2 Definition (see also |TT!l). For fj.i,fj.2 two measures on some metric space iX,d), 
"Woo(jUi,/i2) ^ £" means that there exists a probability measure 7 on such that the 
two respective marginals of 7 are jUi and IJ.2 and such that d(xi,X2) ^ e 7-a.s.. This 
defines a (possibly infinite) distance on the probability measures on E. 

The fundamental lemma of this subsection is the following 

5.3.3 Claim. For ojj g R, denote by nioij) the law of coi under P['\ci)K\t){j] = i(SK,(^j)l 

There exists a function e : Z ^ [0, 1] with eid) < 1 as soon as d>0 and aid) ^ Ce ^ 
for some 1// > and C <oo, such that 

y(d],(d^j£U Woo(M^^]),)U(^3y))^e(U-j|)|w|-w)|. (TE) 

Proof. The proof relies on the 'explicit' construction of a coupling measure 7 be- 
tween /i(t()j) and jJiico^). To do that, we will construct 'P\_-\a)K\t)[j} = (^K,<^j)] thanks 
to a reversible Fokker-Planck dynamics, and then couple the dynamics for o)^ and w^. 

We define the Fokker-Planck dynamics thanks to independent white noises 
(dB^)t£u for i' eZ"-'-iK\±> {j}). The motion of point i' is defined by: 

dcoi^ = -piV'iwiO + ^ WVf - o)i")) + V2dBi', (TF) 

with the boundary condition coK\t){j} = WifaO) times. Coupling then consists in 

taking the same noise for the two processes. The initial condition is not very important 
since it is asymptotically forgotten, so we will suppose that the two systems have been 
coupled for an infinite time, so that at any time both systems follow their equilibrium 
law. We denote by w^it) the system correponding to the boundary condition "(Ji)K\t>{j] - 
ia)K,toj)", resp. by lo^it) the system correponding to the other boundary condition. We 

denote Aj'(^) := co'^,it)-(i)]fit). Then when the dynamics are coupled, A evolves according 
to the following equation: 

diM = -p[{V'{wf,) - V'ico],)) + ^ (W'iwl - co^,) - W'io)], - aj]„))l (TG) 
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Obviously the right-hand side is not a deterministic function of A(i), but it can nonethe- 
less be written as 



-P[vii', t)AAt) + wii', i", t)iAAt) - AAt))l 

for some v(i',t) and wii',i",t) satisfying 

vii',t) ^ i;* and 
0^w(i',i",t) ^ w* 



by Assumption |5 .3. 1| Moreover, one has the boundary conditions: 

Aif = 0; 



yt 



(i)^.-a)^-. 

J J 



(TH) 



(TI) 
(TJ) 



(TK) 



So, A is the solution of some discrete 'damped heat equation', whose coefficients can 



vary along time though having to satisfy bounds (|TTj) and (jTJj). Such an equation has 
no stationary solution stricto sensu; however there exists some A"*" such that 



A(^) ^ A+ 



yt'^t Aa')^A+; 



(TL) 



namely, this A"*" is defined as the solution of the following system of equations: A^ = 0, 
A+ = a>2 - m], and for all i'€K\±) {j}. 



= -u.A++ ^ 1^+ ^^+u;*(A+ -A+). 



(TM) 



One has similarly that 

Consequently I claim that for all t one has 

O^A(^)^A+: 



(TN) 



(TO) 



indeed if the initial condition of the system satisfies (TO I, then that property remains 



valid for all subsequent times; now, as I told, initial conditions are asymptomatically 



forgotten, so in fact ( ffoj ) is always satisfied. 

One has the following control on A"*" : 
5.3.4 Claim. There exists a function e : Z ^ [0, 1] with eid) < 1 as soon as d > and 
eid) ^ Ce ^ for some y/>0 and C <oo, such that 



VieZ" A+^edj-il). 
Moreover, the function e does not depend on K nor on j. 



(TP) 



Combining ( TO I with Claim 5.3.4 ends the proof of Claim 5.3.3 



□ 
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Proof of Claim [5. 3. 4\ First, notice that Equation dTMb satisfies a maximum principle, 



so we know in advance that A"*" is uniquely defined with ^ A"*" ^ 1 everywhere. 



For i' ~ i" , denote Wi'ii") ■= 1^+ ^ ^+w* . Then (TMl can be rewritten into: 

i" " i' I 

wAi") 



X A,-" + 



i^iiV*+Zi"~i'Wiii") ' ' v*+Zi"~i'wAi") 



xO. 



(TQ) 



Now I define the following Markov chain on Z" \±){d}, d denoting a cemetery point: 
5.3.5 Definition. 

• If at some time the particle is on some point i' of Z"' ~^iK\±) {j}), at next time it 
jumps onto the neighbour i" of i' with probability Wi'(i")/{v* Wi'ii")), and 
it jumps onto d with probability / (u* + ^vH"))', 

• If the particle is somewhere m.K\±} {d,j} at some time, then it does not move any 
more. 

Call {Xt)teN such a Markov chain and denote by if its generator. It is clear that 
with probability one, Xt eventually remains at some point of K\ii{d,i}. Extend A"*" 



to Z" i+J {5} by setting Aj = 0; then, (TQI merely means that A"*" is ^-harmonic, and it 
follows that 



A+ = E[AXoo)|Xo = i]. 
Thus, to bound above A^ I write that 



(TR) 



E[AXoo)|Xo = i] 



t-1 

ii,...,it-i€Ki±){j] 



t-1 



i=iQ u=0 ^i"~iu'^i 

ii,...,it-i^j 



Wijiu+i) I 2dw* \* 



-1) I zaw Y 
ii^)^2dw* +vj 



I 2dw \ 



t-1 



L n 



WiMu+i) 



-i) ( 2dw* \\J-i\ ^^^^ 

^ . (TS) 

y2dw*+vJ 



^1 



□ 



From Claim 5.3.3 we take the following 



5.3.6 Corollary. For a Lipschitzian function f : IR ^ K, denote by \\f\\Lip the optimal 
Lipschitz constant for f. On L^icoi), define the (possibly infinite) norm W'Wup such that 
\\fi(^i)\\Lip - \\f\\Lip^^\' denote by Lipiwi) the corresponding Banach space. 



Then under the law Pb\a)K - (^kX the map Trai^ai^ defined by ^AJ^ is eilj - i\)- 
contracting when seen as an application from Lipicji) into Lipiwj). 



'^*^This definition can be ambiguous if the support of is not the whole ffS; in this case, just add an 
infimum in the definition. 
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Consequently, the map nojiWjWi '■ Lipiwi) ^ Lipid) {) is edj - -contracting. But the 
canonical embedding Lipiwi) L^ioJi) is continuous as our hypotheses ensure that 
Law{(i)i) is uniformly log-concave, therefore for all f eLipicoi) one has 



(TT) 



Since n 



WiCJjCJi 



is self-adjoint in L^icot) and Lipioji) is a dense subset of L^((Wj), it fol- 



lows by Lemma 



0.3.1 



that TTaijOi is £:(|7 - i|) -Contracting also in L This, by 



Remark |1.1.10[ is equivalent to saying that 

{a)i:a)j}~^ ^ e(\j-i\). 



(TU) 



(TUl is what we need to apply Lemma 3.6.8; in the end, we get the 



5.3.7 Theorem. The model ( [T^ is exponentially p-mixing. 



5.4 A hypocoercive system of interacting particles 

For the time being we have only been dealing with spatial decorrelations. Yet I have 
had the idea that the ability of Hilbertian decorrelations to get tensorized for infinite 
sets could be well adapted to the study of temporal relaxation of an infinite stochastic 
system: one can consider indeed time as an extra dimension for the particle system. 



which leads to a situation analogous to the parallel hyperplanes of § 0.1. c In the 



reversible case, we saw that spectral techniques make it possible to get L results 



from results, cf Theorem 0.1.9 Here I will show how Hilbertian decorrelations can 



be used for a non-reversible particle stochastic system. 

The system which we will study here as an example is governed by a kinetic Fokker- 
Planck equation. This equation, which arises naturally in physics, corresponds to a 
Hamiltonian evolution perturbed by some noise acting on speeds. The study of such 
systems is made complicated by the fact that diffusion is only performed along certain 
directions of the states space, so that the non-reversibility of the evolution is essen- 
tial to ensure convergence to equilibrium. In [42], Villani proves convergence for 
such systems in situations where the state of the system lives in a finite-dimensional 
manifold. Here we will use tensorization of Hilbertian decorrelations in a fundamen- 
tal way to get a result valid in an infinite-dimensional setting. Moreover, we will get 
non -trivial bounds for arbitrary small times, which is a new feature compared to [42|. 



5.4.1 Definition. For real parameters m,a»,c, we consider a system of par- 

ticles i indexed by Z, each particle being described by its momentum pi €U and its 
position Qi^U. We consider the Hamiltonian 

2 2 2 

Hip,q) = m 2^ — + mw 2^ — + mc 2^ . (TV) 

ieZ ^ ieZ ^ ieZ ^ 

^^^m is the mass of each particle, w is the frequency corresponding to the pinning potential, c is more 
or less the speed of sound, expressed in inter-atomic distances by unit of time, T is the temperature and 
A is the relaxation constant of the friction. Physical homogeneity of these constants are resp. [M], [T~^], 
[T-1],[ML2t-2],[T-1]. 
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Then the system ip(u),q(u)) evolves according to the Hamiltonian H, plus a white 
noise independent on each pi, plus a friction force Ft = -A.pi on each i which dissipates 
the energy brought by the white noise, friction being adjusted to the noise so that their 
association constitutes a (volumic) thermal bath at temperature T. One computes that 
this means that the quadratic variation on is given by d[pi] = 2TXmdu. 

In other words, if denotes a family of independent brownian motions, 

the evolution of the system is given by 

dpi = ( - m(i)^qi + mcHqi-i + qi+i - 2qi) - Xpi)du + \/2T\mdWi 

J -1 J 

dqt - m pidu. 



5.4.2 Remark. The system of Definition 5.4.1 is to be thought as a toy model for a large 



class of similar systems obtained by generalizing it in several ways. A first example, 
which would change almost nothing but complicating the formalism, is to replace the 
states space UxU of each particle by IR" x [R'*, or to replace the lattice Z by Z"^. A trickier 
generalization is to consider the case of non-harmonic interactions: then I expect the 
results stated below to remain qualitatively true, but proving them might be far more 
difficult since one cannot use the properties of Gaussian vectors any more. Also, if one 
allows for infinite-ranged interactions, which speed of decay is required to get temporal 
decorrelations? 

All these questions look quite worthwhile to me, though answering them is out 
of the scope of this work. Here I will only show how Hilbertian correlations make 
everything work fine for the toy model, hoping that it shall be useful for the general 
situation. 

Let us consider the equilibrium dynamics of our system. We fix an arbitrary time 
< ^ < oo. Denote by ipi,qi) the state of particle i at time u = 0, resp. by ip'i,q[) the 
state of particle i at time u = t. We have to prove the 



5.4.3 Claim. Provided t is small enough, for all i,j eZ (possibly identical), one has 
{pi:p'j}*,{pi,q'j}*,{Qi'-p'j}*,{Qi,Q'j}* < X uniformly in i,j. Moreover, still uniformly 
in i,j, these quantities are bounded by 0(e~^'-'~^')/br some y > 0. 



Proof. We denote by t] (resp. t]') e [R^'^fP'^' the global state {pi,qi)iez (resp. {p\,q\)iez) 
at time (resp. t). We also denote by {(p^)u^Q the semigroup of operators on K^^'f^'^' 
corresponding to the evolution of the system in absence of noise, but with the friction 
remaining. Since the system is linear, the (p^ are linear operators. 



By the work of § 5.2 , we know that t] is distributed according to the centered Gaus- 
sian law with covariance matrix T~^C, where C is defined as Q~^, the matrix Q being 
in turn defined by: 



QpiPi 

Qqiqi±i 



m"^; (TX) 
m(w^ + 2c^); (TY) 
-mc^, (TZ) 



the other entries of Q being zero. Observe that, as the matrix of a quadratic form, Q 



is bounded (this is obvious from (TXMTZI); moreover, Q ^ (actually exists and) is also 



bounded: that follows from Q's being bounded below by the matrix having the same 
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expression with c replaced by 0, which we denote by Q°, which is a strictly positive 
'scalar' matrix (modulo some homogeneity constant). 

Because of the linear nature of the system, we have moreover that, conditionally 
to T], the law of rj' is some Gaussian vector of the form (p^ij + 0, where is a centered 
Gaussian vector whose law does not depend on 77. Let us denote by C the covariance 
matrix of 0, and Q = — though for the time being it is not clear that Q exists. 

Then, we can formally write the covariance matrix C of (77,77') as C = Q~^, with: 

Qiil,r]') = Qir]) + QiT]'-(p'r]). (UA) 

(Note that Q is a quadratic form on u^^^P'1'P''1'\ while Q and Q were defined 

on K^^f'P''?'). 

5.4.4 Notation. In the sequel, we shorthand "Z x {p,q}" into "Z'*'^", resp. "Z x 
{p,q,p',qr into "Z'^^". 

Now I claim that there exists constants < r ^ R < 00 such that rl^Q ^ RI. Well, 
this is meaningless stricto sensu, because all the entries of Q do not have the same 
physical homogeneity, so we have to 'convert' momenta into positions by dividing them 
by some homogeneity parameter x, say x = Jnco — but other choices may be more rele- 
vant. 

First, I claim that Q ^ ^ix^m~^ Amw^)I. Let indeed {rj,r]') = ipz,Qz,p'z^Q'z^ ^ 
with finite support. We observe that 

IK^, = E (x'^pI + + x'^pI + q1) = Wvf + \Wf, (UB) 

ieZ 

so that either \\r]f^ 5 II (77, 77') 11^ or ||77'||2 ^ ^11 (77,77') 11^. Now, recalling the definition of Q° 



a few lines above, ^(77) ^ Q°ir]) ^ (;t:^m"^ a ma)^)||77l|^, so by (jUAjl, ^(77,77') ^ ix^m~^ a 
mw^)||77l|^. Since reversing the direction of time yields the same system with the sign 
of speeds reversed, which does not change the norms of 77 and 77', one has similarly 
Qir],r]') ^ (x^m"^ Amw^)||77'||^. The claim follows. 

The second point consists in proving that Q is bounded above. On the one hand. 



by (TXMTZ I, 



Q(77) ^ (m~^x^ V m(w^ + 4c^))||77l|2 ^ (m~^x^ v miw'^ + 4c2))||(77 , 77')||^ (UC) 

Next, the difficult point is to prove that ^(77' - ^^77) (exists and) can be bounded 
above by a multiple of 11(77, 77') ||^. We begin with transforming the original problem of 
bounding a quadratic form on IR^ into a problem on . Indeed, ||^*77|| is bounded 
by a multiple of II77II, since the operator (p* dissipates the energy ^(77), energy which 
the previous work on Q proved to be controlled below and above by ||77||^; therefore, it 
suffices to prove that the quadratic form Q(77) on IR^^ is bounded by a multiple of ||77||^ 
to achieve our goal. 

The natural quantity to be computed for 6 (recall that 6 denotes the total effect of 
noise between times and t) is its covariance matrix C. Its expression is the following 
(the notation is explained just below): 

C^2TAm r (p*-npi(p*-''fdu, (UD) 
Jo 
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where Ip is the diagonal matrix being 1 on diagonal entries indexed by some p t and 
on diagonal entries indexed by some qi, and (^^~")^ is the transpose of the linear 
operator (p^~^ seen as a square matrix indexed by Z'*'^. This decomposition means 
that we are summing the contributions of all the elementary noises occuring at times 
u e [0,^], using that these elementary noises are independent. 

Now we need an approximate expression for (/)", u g [0, Here for the sake of 
legibility I will remain at a formal level, giving only limited expansions; it is essential 
nevertheless to keep in mind that all the "0(*)" can be made explicit by using Gron- 
wall's lemma, and that these explicit values ensure that the 0(*) behave well provided 
t is small enough. One finds that 



^''Sq^-Pi 

(P^'SqrPj^i 



u 



(2|j-i|)! 



+ 0{u 



2\j-i\+2 



); 



„2|i-i| + l 

(2|j-i| + l)! 



+ 0{u 



2\j-i\+^ 



); 



O Q 

mo) u + 0{u ); 



2U-i|. 



U 



2\j-i\-l 



mc 



U 



(2|j-i|)! 



I-D! 
+ Oiu 



+ Oiu 



2\j-i\ + l 



); 



2\j-i\+2 



). 



Injecting Equations (|UE[)-(|UI[) into (|UDj), one finds that 

2TAmt + 0it^) 



'PiQj^i 



TXt^ + Oin; 
^TXm-h^ + Oit^); 

Q(^2U-i|+3)_ 



(UE) 

(UF) 
(UG) 
(UH) 

(UI) 



(UJ) 
(UK) 
(UL) 
(UM) 
(UN) 
(UO) 



Consequently, the covariance matrix C can be seen as a perturbation of the ma- 
trix C° which is defined by Equations (UJMUOl, but with the "0(*)" terms replaced 
by 0. Since C° is invertible, with an explicitly computable inverse, one finds that C is 
invertible too with: 



QpiPi 

QpiQi 

QpiPj^i 

QpiQj^i 



2T-^X-^m~^t~^ + 0it); 
-ST-^-^r^Od); 
6T-^X~^mt~^ + 0it~^y, 
0(^21^-^1-^); 
0(^21^-1-2); 

0(^2U-i|-3)_ 



(UP) 
(UQ) 
(UR) 
(US) 
(UT) 
(UU) 



In the end, provided that t is small enough, we have proved that QiT])/\\T]\\' 

er-^mr^ + o(r i)<oo. 

['^Recall that C, as a covariance matrix, is symmetric. 
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Actually we have proved more than that: not only we have a bound on the operator 
norm of Q, but we have bounded it entry-wise. More precisely, expanding the Oi*), we 
find that provided t is small enough, there exists constants A < oo and 7 > such that 
for all i,j eZ, 

-r\J-i\^ (UV) 



all 16 possibilities 



5.4.5 Notation. From now on we denote the basic variables Pi,qi,p\,q'i of our system 
byZ,,/GZ«4_ 



Now the question is: for i^jE Z^'^, if c Z^"^ - {i,j}, what is the value of {Xi : Xj}-^? 



By the properties of Gaussian variables [Theorem |1.2.6[ , the answer is the following. 
Let Q\^^^ ^ be the restriction of Q to indexes in (Z^^-^-K). Since rl^Q ^ RI, the same 

holds for Q|^y4 ^, so this matrix is invertible; denote by C' its inverse. This matrix 
is the covariance matrix of (the centered version of) X^iti4^j^ under some fixed value 
for Xjc', thus: 

c ' 



II JJ 



(UW) 



It remains to control the entries of C' , uniformly in K. We need two types 
of control: first an exponential control when i is far away from j, then a non-trivial 
control for the values of i and j corresponding to close (or even identical) atoms. 



Let US start with the first one. C . . and C . . are bounded below by i? ^, sowe 



JJ 



just have to bound above C 
appendix. 



ij 



This is achieved by a direct use of Lemma 



5.5.1 



m 



Concerning the uniform non-trivial control, since rl^Q ^ RI one has rI^Q\ ^ 



RI, hence R'^I ^ C' 



^ r-^I, hence i?-^! ^ (Cl '^)| 

ViT^jG/ {Xi-.Xj} ^ 



R + r 



r ^I; from this and (UWI, 



(UX) 

□ 



From Claim 5.4.3 we get the main result of this subsection: 



5.4.6 Theorem. For the model of Definition 5.4.1 for all t>0, {77,77'} < 1 



Proof. First, if t is small enough so that Claim 5.4.3 holds, direct application of 



Lemma 3.6.8 proves the result, as the are summable (since they decrease 

exponentially) and they all are < 1. 



Now for larger t, fix some < < i so that Claim [5l4.3 holds for ^i. Then we notice 
that 77 77(^1) ^ 77' is a Markov chain (with "rjitiT standing for "ipiti),qiti))"), so by 
Proposition |1.1. 13 [ {77,77'} ^ {77,77(^1)} < 1. □ 
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5.5 Appendix: Inverses of 'nearly diagonal' matrices 



The goal of this appendix is to state and prove a few lemmas sharing the same spirit: 
"i/a matrix is 'nearly diagonal', then it shall be invertible and its inverse shall also be 
'nearly diagonal' with the same type of decay". 



5.5.a Matrices with exponential decay 

The goal of this subsection is to prove the following 

5.5.1 Lemma. Let / c Z and let ((Afij))(jj)e72 be a matrix. Assume that, when seen as 
a quadratic form on L^{I), one has rl^M ^ RI for < r ^ R < oo — in particular, M is 
invertible. Assume moreover that there exists constants A <oo and j> such that for 
all ijel, \Mij\^Ae-^^J-'^. 

Then there exist constants A' <oo and y' > which are explicit functions of r,R,j,A 
(so they do not depend on I), such that one has the following control on the entries 
ofM-^: 

Mijel (M-%-^A'e-^'l^-^l. (UY) 



Proof. Up to multiplying by a scalar, one can assume that R = 1. Then M writes 
M = I-H, where ^ H ^ (1- r)I; since H is symmetric, that inequality means that 
\\\H\\\ ^ 1 - r < 1. Therefore, for all yfe £ N one has ^ (1 - rf, which allows us to 

write as a series expansion: 

oo 

M"^=^H^. (UZ) 

^=0 



Up to replacing A by A + 1, we have the same entry-wise control on H as on M. 
Then one sees by induction that for all k eN, 

Mijel \{H%\^A\e-'^^\^-'\ (m) 

where jl is an arbitrary parameter in (0,7) and 

Ai := y Ae-^l^l+^1^ = ^^^^ , ^ (VB) 

^tz (l-e-<r-ri))(i-e-(r+ri)) 

— observe that / does not appear in the expression of Ai. Since Ai is greater than 
1, (VAI is not enough to get an entry-wise control on M~^. But now observe that the 
bound III//* III ^ (1-r)* implies that all the (H^ij are bounded by (1 - r)^ in absolute 
value; thus: 



\iM-\j\ ^ £ (e-T-il^-^lAj A (1 - r)^) ^ — ^ + - exp 



|log(l-r)lri ] 



|log(l-r)|+logAi 



(VC) 

from which you read suitable values for A' and 7'. □ 
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5.5.b Convolution inverses of rapidly decreasing functions 

In all this subsection, we work on Z"- for some n eN*; U"- is endowed with some 
fixed norm \-\. 

5.5.2 Remark. Here I will deal with fonctions on Z", but the results of this subsection 
could also be tranposed for functions on IR". 

5.5.3 Definition. If a : Z" ^ [R is some integrable function with ||a||;i < 1, we define 

B[a] = a + a*a + a*a*a + --- , (VD) 

which is the sum of a convergent series in ZHZ"). B[a] is the function b e ZHZ") char- 
acterized by: 

i6o-a)*i6Q + b) = do. (VE) 

5.5.4 Definition. A function a : Z" ^ K is said to have exponential decay if there exists 
some p > such that, for all 13' < (i, a{z) = 0{e~^'^^^) when \z\ oo. The minimal ^ 
satisfying that property is called the (exponential) rate of decay of a. 

5.5.5 Lemma. Let a £ l^iZ"') with ||a||^i < 1. If a has exponential decay, then so does 
Bial 



Proof Denoting by \a\ the function defined by \a\iz) - \aiz)\, it is clear by ( [VD| ) that 

V2GZ" \B[a]iz)\^B[\a\]iz), (VF) 

therefore it suffices to prove the case where a is nonnegative. In that case, B[a] will 
also be nonnegative. 

Let (IR")* denote the dual space of IR", endowed with the dual norm 

VAg(K")* |AU = supKA,0>|. (VG) 

For a nonnegative function a, we define its Laplace transform if {a} : (IR")* ^ IR+ u {+00} 
by 

if{a}(A)= e^^'^^aiz). (VH) 

zeZ" 

Then, saying that a has exponential decay with rate 7 is equivalent to saying that, for 
all A G (R")* with |AU < J, ^{a}(A) is finite. 

Since Laplace transform is linear and turns convolution into ordinary product, ( |VD[ ) 
yields, for all Ag(F8")*: 

S^{B[a]KX) = S£{a}{X) + S£{a}{Xf + S£{a}{Xf + • • • , (VI) 

which converges if and only if £^{a}(X) < 1. 



Now, since a is nonnegative, by (jVHj) the function if {a} is convex, so it is continuous 
on the interior of the domain where it is finite. By the exponential decay hypothesis, 
that domain contains a neighbourhood of 0, so if {a} is continuous at 0. And since 
if{a}(0) = Z2eZ"'^(2) = ||a||;i < 1, there is a neighbourhood of on which if {a} < 1 and 
thus i?{fi[a]} < 00. This implies that B[a] has exponential decay. □ 
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5.5.6 Remark. This proof also shows that (for nonnegative a) the rate of decay of fi[a] 
will never be greater than the rate of decay of a. In general, it is even strictly smaller, 
since all the values of A for which 1 ^ £^{a}iX) < oo yield a finite Laplace tranform 
for a but an infinite one for B[a]. For example, take n = 1 and a = e~^6i, which has 
exponential decay with infinite rate since it is compactly supported; then the ^-th 
convolution power of a is a*^ = e~^dk, so that B[a] is the function 

5[a](^) = l2>oe"^ (VJ) 
which also has exponential decay, but with rate 1 only 

5.5.7 Lemma. If WaWmj^n^ < 1 and a(z) = 0(l/\z\") when \z\ oo for some a> n, then 
B[a]iz)= Oil/\zn when \z\ oo. 



Proof. Let a satisfy the assumptions of the lemma for some a. Like in the proof of 
Lemma 5.5.7 , we can assume that a is nonnegative. For d > 0, we define the func- 
tion : Z"^IRby: 

ipdiz)--=yi\z\Adr, (VK) 

which is in Z^(Z") since a> n. Then the key claim is the following sub-lemma, whose 
proof is postponed: 



5.5.8 Lemma. Under the assumptions of Lemma 5.5.7 there exists some p < 1 and 
some d g (0,oo) such that, pointwise. 



(pd*a^p(pd. 



(VL) 



Admitting Lemma 5.5.8 take p and d such that ( |VL| ) is satisfied. The assumption 
on a implies that there exists some C < oo such that a^Ccpd', therefore by (|VL]| one also 
has a * a ^ Ccpd * a ^ pCcpd, whence by (VLl again a * a * a ^ pC(pd * a ^ p^C(pd, etc.. In 
the end, 



B[a] ^ C(pd + pCcpd + p C(pd + ' 
which implies thatB[a](2) = 0(1/|2|"). 



1-p 



(Pd, 



(VM) 



□ 



Proof of Lemma p. 5. 8\ Denote S ■= \\a\\ii, which by hypothesis is < 1, and fix e g (0, 1/2) 
such that (1 - e)" > S. Let d g (0,oo), devised to be quite large; our goal is to bound 
above {(pd * a){z) for all z g Z". Since (pd is bounded above by d~", one has obviously 
for all 2 gZ": 

(cpd * a)iz) ^ d-" ^ aiz) = (VN) 

zeZ" 

whence (cpd * a)(z) ^ S(pd(z) for all z with \z\ ^ d. Since S < 1, the claim is therefore 
okay for \z\^d. 

Now, let 2 G Z" with \z\> d. We have to bound above 

i(Pd*a)iz)= <Pd(x)a(y). (VO) 

x+y=z 

We decompose this sum into three pieces: 

i(pd*a)iz)= ^ (pdiz- y)aiy)+ ^ (pdix)aiz-x)+ ^ (pdix)aiz - x), (VP) 

\y\^e\z\ \x\>£\z\ |x|se|2| 

|2-X|>c|z| 
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which we shorthand into "® + (2) + (3)". 

We bound these three terms separately. For (T), we observe that for \y\ ^ e\z\, \z-y \ ^ 
by the triangle inequality, thus (Pd(z-y) ^ ((1 - e)\z\) " = il-e)~"(pciiz), whence 
by summing: 

®^il-E)-''(pdiz) aiyJ^a-eVSipdiz). (VQ) 

\y\^e\z\ 

Similarly, for \x\ ^ e\z\, C denoting a constant such that a ^ Ccpd, one has a{z-x) 
^C((l-£)|0ir", thus: 

(^^{l-eyCWtpdWiiiPdiz)- (VR) 

Of course, depends on d; the important point is that, by dominated convergence, 

llV'dll/i ~^ when d ^ oo. 

Finally, provided d is large enough. Term (2) will be well approximated by an inte- 
gral: 

1 c r c 

@ ^ y X =^ / dx, (VS) 

^^jn \X\ \Z XI J\ocl\z-x\>e\z\^^^^^^ 
\x\,\z-x\>e\z\ 

where "-" means that the ratio between the quantites at each side of that symbol can 
be made arbitrarily close to 1 when d oo, uniformly in z. Indeed, the difference be- 
tween the sum and the integral is due to two causes: first, approximating the integral 
on a unit square of IR" by the value of the integrand at the center of this square, sec- 
ond, summing (or not summing) terms of the discrete sum corresponding to squares 
that are not entirely in the domain of the integral. For the first cause, on the domain 
of the integral, C/(|x||2 -x|)" varies of at most 0{ll\z\) in relative value on all the unit 
squares. For the second cause, the border of the domain of the integral is made of two 
in - l)-dimensional spheres of radius £\z\, so it crosses 0(12^-^) unit squares. Since 
Cl{\x\\z-x\)"- is bounded by C{e{l - e))""!^!"^" on the domain of the integral, the (ab- 
solute) error due to boundary squares is at most 0{\z\"'~^~'^°^). As the integral itself 
is proportional to \z\"~^" (cf the change of variables below), the relative error due to 
boundary squares is at most 0(l/\z\) too, and 0(1/1^1) = o(l) since \z\ > ed. 



Making the change of variables x = l^lx', (VS I becomes: 



I < 



C\zr^" f — —dx', (VT) 

j\x'\,\i-x'\>e\x'ni-x'r 



which I shorthand into "(2) < j^Cl^r-^"". Since \z\>d and a> n, this bound implies: 



J' 



^^^Vdizl (VU) 



Combining ( |VQ] ), ( |VR| ) and (|VU]), one finally gets that when d ^ oo, for all \z\ ^ d, 

i(pd*a)(z)^pid)cpd(z), (VV) 

with 

pid) = (1 - er"iS + C||a||;i) + (1 + o(l))J?C/d«-". (VW) 

pid) tends to (1 - £:)~"S < 1 when cZ ^ co, so it is < 1 provided d is large enough, which 
is what we wanted. □ 
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